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Preface 



Many standard numerical algorithms for fluid dynamics have their roots 
in astrophysics; the reasons are probably twofold. First, since astrophysical 
objects are normally not amenable to experimental studies, astrophysicists 
seek some understanding by simulating them by methods of computational 
physics. With the advancement of computer technology and numerical al- 
gorithms, complex astrophysical phenomena such as supernova explosions, 
accretion of material onto a star, stellar pulsations, or the granular pattern 
of solar convection are now accessible to simulation almost as if they were ap- 
proachable via experiments in the laboratory. Second, the extreme conditions 
that prevail in astrophysical flows such as extremely high shock strengths, or 
very high compressibility of the fluid lead to the development of novel nu- 
merical methods that would normally not have been required for terrestrial 
engineering applications. 

Computational astrophysics has become an important branch of astro- 
physical research and many students and researchers of astrophysics are at 
some point in their career confronted with computer simulation results, or 
the prospect of executing simulation calculations, or even the writing of a 
simulation code. Considering the wealth of numerical schemes and computer 
codes available for astrophysical fluid flow, for novices it is not easy to assess 
and evaluate simulation results or to choose the correct scheme and to avoid 
at least “commonly known” pitfalls when carrying out their own simulations. 
The present book should provide some help in such circumstances. More- 
over, it might also prove to be a valuable reference for the more experienced 
computational astrophysicist. 

The first part of this book leads directly to the most modern numerical 
techniques for compressible fluid flow, with special consideration given to as- 
trophysical applications. Emphasis is put on high-resolution shock-capturing 
finite-volume schemes based on Riemann solvers. Examples of unphysical 
solutions resulting from lackluster methods and incorrect applications are 
discussed, as well as more advanced topics such as MHD-Riemann solver 
or computational methods for (general) relativistic fluid flow. An extensive 
literature list leads the reader to specific topics. 

The applications of such schemes, in particular the PPM method, is por- 
trayed in the last part of the book (the colorful finale). Examples of large-scale 
simulations include supernova explosions by core collapse and thermonuclear 
burning and astrophysical jets. They demonstrate the interplay of observa- 




VI 



tions with simulations and exemplify the deeper physical understanding of 
astrophysical objects that can be gained through computer simulations. 

For most astrophysical fluid flow, radiation transfer needs to be (should 
be) taken into account, hence part two and three, which treat the daring 
subject of radiation hydrodynamics. Dimitri Mihalas presents - in a light- 
hearted style - the basic equations of “radiation hydro” that were otherwise 
only accessible in his voluminous “Foundations of Radiation Hydrodynamics” 
(now out of print). Mihalas’ contribution is largely identical to his handwrit- 
ten notes that he uses in teaching radiation hydrodynamics at the University 
of Illinois at Urbana Champaign. The numerical implementations of these 
equations and simulation examples are the subject of part three. The power 
of adaptive (moving) grids, which are capable locally refining the resolution 
by several orders of magnitude, is demonstrated with a number of stellar- 
physical simulations showing very crispy shock-front structures. Interestingly, 
these powerful radiation hydrodynamic codes are based on more traditional 
finite volume techniques using artificial viscosity and it seems that Riemann 
solver methods have not yet been applied in this difficult field of computa- 
tional astrophysics. 

This book is a written version of the the lectures delivered by Ernst Dorfi, 
Randall LeVeque, and Ewald Muller during the 27th “Saas-Fee Advanced 
Course”. Dimitri Mihalas, who was scheduled as a lecturer, had to withdraw 
shortly before the course began because of an accident which forced him 
to remain in hospital for several weeks. The organizers of the course were 
very glad when Ernst Dorfi accepted to step in as a lecturer on numerical 
radiation hydrodynamics and when Dimitri Mihalas offered to contribute 
his (not delivered) lectures to the present book. Ernst Dorfi wrote a more 
comprehensive text on the subject of his ad hoc lecture. 

The 27th “Saas-Fee Advanced Course” of the Swiss Society for Astro- 
physics and Astronomy took place in Les Diablerets, a small village in the 
Swiss Alps, during March 3-8, 1997. Ninety participants from 14 countries 
attended the course. The Eurotel provided a much appreciated hospitality. 
As can be deduced from the photographs in the book, sun and snow were 
also abundant and appropriately enjoyed. 

This course would not have been possible without the financial contribu- 
tions from the Swiss Society for Astrophysics and Astronomy (through the 
Swiss Academy of Sciences) which are gratefully acknowledged. O.S. would 
like to acknowledge the generous support by the High Altitude Observatory 
of the National Center for Atmospheric Research. A.G. was financially sup- 
ported by the Swiss National Science Foundation through a PROFIL2 fellow- 
ship. Last but not least we are grateful to Wolfgang Loffler for professionally 
maintaining the Saas-Fee Web site. 



Freiburg i.Br., Basel 
May, 1998 



Oskar Steiner 
Alfred Gautschy 
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Nonlinear Conservation Laws 
and Finite Volume Methods 

Randall J. LeVeque 

Department of Applied Mathematics and Department of Mathematics, University 
of Washington, Box 352420, Seattle, WA 98195-2420, rjl@amath.washington.edu 



1. Introduction 

These notes are primarily concerned with the numerical solution to hy- 
perbolic systems of conservation laws that arise in astrophysical calculations, 
primarily in “gas dynamic” simulations. The Euler equations are one fun- 
damental system that will be discussed in detail, along with generalizations to 
magnetohydrodynamics (MHD) and relativistic flows, and to include source 
terms modeling effects such as combustion, radiation, or gravity. 

The Euler equations are a nonlinear hyperbolic system of equations that 
arise from the compressible Navier-Stokes equations by neglecting viscosity 
and heat conduction. Mathematically the most interesting feature of such 
systems is that they admit shock waves in their solution, discontinuities in 
the solution that can form even from smooth initial data. These are math- 
ematical idealizations of the steep gradients that can be present in smooth 
solutions to the full Navier-Stokes equations, where rapid changes occur over 
very thin zones (relative to the spatial dimensions that must be modeled). 
Often the width of these “viscous shock layers” is much smaller than the dis- 
tance between grid points on a reasonable compuational grid, in which case 
we cannot possibly model the structure in these layers and are satisfied to use 
a model that produces a good approximation to the macroscopic behavior. 
There is a rich mathematical structure to the hyperbolic equations that can 
be exploited to develop numerical methods that are very effective in spite of 
the fact that the computed solution should be nearly discontinuous. Com- 
putationally, solving the hyperbolic approximation also has the advantage 
that explicit methods rather than implicit methods can typically be used, 
resulting in a substantial reduction in the computing time required. 

Naive discretizations of the differential equations typically either result 
in very smeared-out representations of shocks, or else produce oscillations 
near discontinuities. This is because a finite-difference approximation to a 
derivative simply isn’t appropriate when the function is discontinuous. Fig- 
ure 1.1 shows some sample calculations for a “shock tube” problem discussed 
in Sect. 2.6. Figure 1.1(a) shows the smeared result from the first-order 
Godunov method introduced in Sect. 4.5. Figure 1.1(b) shows the results if 
we instead use a Lax-Wendroff style second-order method, which should be 
a superior method if the solution being computed were smooth, but here 
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R.J. LeVeque 




Fig. 1.1. Solution to the Sod problem for the Euler equations, as computed with 
three numerical methods, (a) Godunov’s method, (b) A Lax-WendrofF style second- 
order method, (c) A high-resolution method with the superbee limiter 



produces a very oscillatory approximation to this discontinuous solution. A 
variety of high-resolution numerical methods have been developed over the 
past several decades that resolve discontinuities sharply and yet produce at 
least second-order accuracy in smooth regions of the flow. Figure 1.1(c) shows 
the same calculation with a high-resolution method. 

The main goal of these notes is to introduce the ideas used in this type of 
method. Understanding these numerical methods requires a good foundation 
in the mathematical theory of hyperbolic conservation laws, which is also 
reviewed here, together with a brief survey of some fundamental concepts 
from numerical analysis. We will then take a look at some complications that 
arise in the more complex equations important in astrophysical applications. 

In such a short course it is not possible to fully develop all the ideas 
discussed. Instead my goal is to present an overview of main ideas and ap- 
proaches, including computational difficulties as well as successes, in a frame- 
work that I hope will serve as a roadmap to students or researchers new to 
this area. Most of the topics covered here are discussed extensively in the 
literature, often in textbooks as well as research papers. I will try to give 
enough pointers that the reader can at least get started tracking down the 
details on any topic of interest. I have not, however, attempted to provide a 
complete bibliography to any of the subjects discussed, and there are many 
approaches and interesting works which are not cited or surveyed here. 

I am not an astrophysicist, but rather an applied mathematician and nu- 
merical analyst with an interest in several applications where very similar 
mathematical equations and computational problems arise from the study of 
seemingly diverse phenomena. The equations discussed here are used directly 
in the study of aerodynamics, gas turbines, internal combustion engines, fu- 
sion reactors, and a host of other computational fluid dynamics applications. 
Moreover, the mathematical structure of hyperbolic conservation laws arises 
in many other fields including elasticity, flow in porous media, oceanography, 
and atmospheric sciences. Studying the literature in these other fields and in 
the applied mathematics community can be valuable in understanding many 
of the mathematical subtleties of these equations. Numerical methods have 
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been developed in all of these areas and often good algorithms can be adapted 
to problems in other fields. 

I have little direct experience in computational astrophysics, and am very 
grateful to the organizers of this course for giving me the opportunity to learn 
more about this fascinating application and some of the techniques developed 
in this field. In this learning process I have benefit ted enormously from the 
other lectures in this course, conversations with many of the participants, 
and also from the help of countless other people who have generously shared 
their knowledge with me and in some cases commented on earlier drafts of 
these notes. In particular I would like to mention Derek Bale, Bruce Balick, 
Dinshaw Balsara, Jerry Brackbill, Moysey Brio, Phil Colella, Ewald Muller, 
Philippos Papadopoulos, Ken Powell, Ed Seidel, Michael Shearer, Phil Roe, 
Dongsu Ryu, Oskar Steiner, Bram van Leer, and Rolf Walder. Of course any 
errors that remain in these notes are my own responsibility. 

My own work on numerical methods for conservation laws discussed here 
has been supported primarily by the National Science Foundation, the De- 
partment of Energy, and the National Center for Atmospheric Research. 

1.1 Software 

Developing computer models of complex phenomena generally requires de- 
veloping your own computer programs. Rarely will you find software that 
easily solves the full problem you are studying. You may be able to obtain a 
research code from another group studying a similar problem, but such codes 
should be viewed with suspicion until you have convinced yourself that the 
methods implemented are appropriate for your problem. It may also turn 
out to be more trouble converting someone else’s code to your needs than it 
would be to write one from scratch, particularly if it is poorly documented 
and not written with such extensions in mind (as is true of most research 
codes). 

On the other hand, there exists by now a large quantity of excellent soft- 
ware available for solving many standard mathematical problems of the sort 
that often arise as part of a larger simulation. Much of this software is freely 
available on the Web.^ When it is possible to make use of such software, it is 
often much more efficient and reliable than anything you can write yourself. 
This is particularly true in areas such as linear algebra or ordinary differential 
equations, where a very general framework covers a huge number of practical 
problems. Linear systems of the form Ax = b arise everywhere, and enor- 
mous effort has been put into developing optimal software for solving such 
systems in a variety of contexts: direct methods such as Gaussian elimination 
for dense systems, faster methods for symmetric or banded systems, and a 
wide variety of iterative methods for sparse systems. For the general initial 
value problem of ODEs, y'{t) = f{q^t) with y{to) given, numerous packages 

^ A few links to software sources can be found from my homepage. 
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are available that automatically select methods and adaptively choose time 
steps to efficiently solve the equation to a desired tolerance. Other packages 
are available for two-point boundary- value problems. 

Partial differential equations are more difficult to handle with general soft- 
ware. There are many different types of equations, exhibiting diverse behavior 
and requiring quite different methods for their solution. Most practical prob- 
lems are also posed in multi- dimensional regions with complicated boundaries 
and boundary conditions, further complicating efforts to develop general soft- 
ware. Nonetheless, more and more software is appearing for particular classes 
of problems, with elliptic equations probably receiving the best coverage to 
date. 

For hyperbolic equations and conservation laws, a number of packages 
have appeared in the past few years. Much of my own effort recently has 
gone into the clawpack (Conservation LAWs PACKage) software [138] and 
the adaptive mesh refinement version AMRCLAW[29j. This software is aimed 
at providing a unified framework for solving a wide variety of conservation 
laws, and hyperbolic systems more generally, using high-resolution methods 
of the type discussed in these notes. 

The CLAWPACK software was initially developed as a teaching tool, and 
studying the fortran code in conjunction with reading the brief descriptions of 
methods presented here may be valuable in understanding how these methods 
are implemented. In addition, many sample programs using this software are 
available with the package, or available elsewhere, and experimenting with 
these can help develop intuition both about the physics and the behavior 
of numerical methods. In particular, almost all of the sample computational 
results presented in these notes were produced with this package and can be 
easily reproduced. 

1.2 Notation 

In general I will use q{x^t) to denote a generic function we are attempting 
to compute with some numerical method. This will typically be a vector- 
valued function. For example, in the Euler equations we have conservation of 
mass, momentum, and energy and these are the components of q. Often u is 
used instead of q in the PDE and numerical analysis literature, but it is very 
convenient to reserve u for the a:-component of velocity. 

Typically vectors will not be denoted by boldface or arrows, but can be 
recognized as such from the context. In the sections on MHD and relativistic 
flow, boldface is used. 

Components of vectors will be denoted either by subscripts or super- 
scripts, and often the notation changes from one section to another, depend- 
ing on which is more convenient. In particular, when numerical methods are 
discussed a subscript is used for the index indicating spatial position, and 
then superscripts are often used for vector components. Only in the sections 
on relativity does position follow tensor notation. 
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1.3 Classification of Differential Equations 



First we review the classification of differential equations into elliptic, para- 
bolic, and hyperbolic equations. Not all PDF’s fall into one of these classes, 
by any means, but many important equations that arise in practice do. These 
classes of equations model different sorts of phenomena, display different be- 
havior, and require different numerical techniques for their solution. Standard 
texts on partial differential equations such as Kevorkian [126] give further dis- 
cussion. 

1.3.1 Second- Order Equations. In most elementary texts the classifica- 
tion is given for a linear second-order differential equation in two independent 
variables of the form 



o^Qxx + hqxy -h cqyy -h dqx eqy fq = g . 



The classification depends on the sign of the discriminant. 







<0 


==> 


elliptic 


= 0 




parabolic 


>0 




hyperbolic 



and the names arise by anology with conic sections. The canonical example 
are the Poisson problem qxx Qyy =9 for an elliptic problem, the heat 
equation qt = qxx for a parabolic problem, and the wave equation qtt = qxx 
for a hyperbolic problem. In the parabolic and hyperbolic case t is used 
instead of y since these are typically time-dependent problems. These can 
all be extended to more space dimensions. These equations describe different 
types of phenomena and require different techniques for their solution (both 
analytically and numerically) , and so it is convenient to have names for classes 
of equations exhibiting the same general features. There are other equations 
that have some of the same features and the classification scheme can be 
extended beyond the second-order linear form given above. Some hint of this 
is given in the next few sections. 

In particular, these notes are primarily concerned with hyperbolic equa- 
tions that are first-order differential equations (involving only first derivatives 
with respect to space and time), and which generally are nonlinear. This form 
is described in Sect. 1.3.4. 



1.3.2 Elliptic Equations. The classic example of an elliptic equation is the 
Poisson problem 

y^q = f, ( 1 . 1 ) 

where is the Laplacian operator and / is a given function of x in some spa- 
tial domain i7. We seek a function q{x) in f2 satisfying (1.1) together with 
some boundary conditions all along the boundary of i7. Elliptic equa- 
tions typically model steady-state or equilibrium phenomena, and so there is 
no temporal dependence. Elliptic equations may also arise in solving time- 
dependent problems if we are modeling some phenomena that are always in 
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local equilibrium and equilibrate on time scales that are much faster than 
the time scale being modeled. For example, in “incompressible” flow the fast 
acoustic waves are not modeled and instead the pressure is computed by 
solving a Poisson problem at each time step which models the global effect 
of these waves. This is described in Sect. 8.6.1. Similar elliptic equations are 
solved in MHD to keep the magnetic field divergence-free at each time (see 
Sect. 8.6). In a self-gravitating flow the gravitational potential at each time 
is typically determined by solving a Poisson problem where the right-hand 
side depends on the density distribution at each time. 

Elliptic equations give boundary- value problems (BVP’s) where the solu- 
tion at all points must be simultaneously determined based on the boundary 
conditions all around the domain. This typically leads to a very large sparse 
sytem of linear equations to be solved for the values of Q at each grid point. 
If an elliptic equation must be solved in every time step of a time-dependent 
calculation, as in the examples above, then it is crucial that these systems 
be solved as efficiently as possible. Very sophisticated iterative methods such 
as preconditioned conjugate gradient methods and multigrid methods have 
been developed for solving sparse linear systems arising from elliptic equa- 
tions. There is a huge literature on these methods; see [41], [101], [244], [255] 
for general discussions of such methods and [171], [172] for some astrophys- 
ical applications. 

More generally, a linear elliptic equation has the form 

Lq = f, (1.2) 

where L is some elliptic operator. This notion will not be discussed fur- 
ther here, but the idea is that mathematical conditions are required on the 
differential operator L which insure that the boundary value problem has a 
unique solution. 

1.3.3 Parabolic Equations. If L is an elliptic operator then the time- 
dependent equation 

qt ^ Lq- f (1.3) 

is called parabolic. If L = is the Laplacian, then (1.3) is known as the 
heat equation or diffusion equation and models the diffusion of heat in 
a material, for example. 

Now q{x, t) varies with time and we require initial data g(x, 0) for every 
X G as well as boundary conditions around the boundary at each time 
i > 0. If the boundary conditions are independent of time, then we might 
expect the heat distribution to reach a steady state in which q is independent 
of t. We could then solve for the steady state directly by setting == 0 in 
(1.3), which results in the elliptic equation (1.2). 

Marching to steady state by solving the time-dependent equation (1.3) 
numerically would be one approach to solving the elliptic equation (1.2), but 
this is typically not the fastest method if all we require is the steady state. 
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1.3.4 Hyperbolic Equations. My lectures will primarily concern first- 
order hyperbolic systems of m equations. The linear problem in one space 
dimension has the form 

qt + Aqa:=0, (1.4) 

where q{x, t) G and A is an m x m matrix. The problem is called hy- 
perbolic if A has real eigenvalues and is diagonalizable, i.e., has a complete 
set of linearly independent eigenvectors. As we will see in Sect. 3.2, these 
conditions allow us to view the solution in terms of propagating waves, and 
indeed hyperbolic systems typically arise from physical processes that give 
wave motion or advective transport. 

The simplest example of a hyperbolic equation is the constant-coefficient 
advection equation 

Qt~\-uqx = 0 , (1,5) 

where u is the advection velocity. The solution is simply g(x, t) = q{x-ut^ 0), 
so any q profile simply advects with the flow at velocity u. 

As a simple example of a linear system, the equations of linear acoustics 
in a 7 -law gas can be written as a first-order system of two equations in one 
space dimension as 



’ p 


1 


0 7 P 0 


P 


u 


-T^ 

t 


. 1/qo 0 


u 



in terms of pressure and velocity perturbations, where po and ^re the back- 
ground pressure and density. Note that if we differentiate the first equation 
with respect to t, the second with respect to x, and then eliminate Uxt = utx 
we obtain the second-order wave equation for the pressure: 

Ptt = C^Pxx , 
where 

c = ^/7P/0 

is the speed of sound in the gas. 

We will also study nonlinear hyperbolic systems in depth, where shock 
waves arise in the solution. In one space dimension we will see (Sect. 2.) that 
physical conservation laws typically lead to systems of the form 

Qt + f(q)x = 0 , (1.7) 

where f{q)x — ^{f{q)) and f{q) is the flux function, as discussed in Sect. 2. 
This nonlinear system is called hyperbolic if the Jacobian matrix f\q) with 
{i,j) element dfi/dqj is diagonalizable with real eigenvalues for each value 
q of physical interest. Linearizing a nonlinear hyperbolic system about some 
state qo then gives a linear hyperbolic equation qt-^f'{qo)qx = 0 where q is the 
perturbation from go- Linear acoustics, for example, arises from linearizing 
the Euler equations of compressible flow given in Sect. 2. 
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2. Derivation of Conservation Laws 

We now turn to the derivation and mathematical analysis of hyperbolic sys- 
tems of conservation laws. This theory is needed to understand both the 
behavior of solutions of nonlinear conservation laws and the development of 
modern numerical methods for their solution. 

This section of these notes is largely a condensed version of my ETH 
Notes [141], and is intended to be an overview of the important points for 
readers who are already familiar with this theory. More details can be found 
in [141] for those who are not. Numerous other books are also available on 
this theory, including [60], [97], [136], [221], [256]. 

Conservation laws are time-dependent systems of partial differential equa- 
tions (usually nonlinear) with a particularly simple structure. In one space 
dimension the equations take the form 

d ^ 

^ 9 ( 2 ;, t) + t)) = 0. (2.1) 

Here q{xyt) G is an m-dimensional vector of conserved quantities, or 
state variables, such as mass, momentum, and energy in a fluid dynamics 
problem. More properly, qj is the density function for the jth state variable, 
with the interpretation that qji^^t) dx is the total quantity of this state 
variable in the interval [xi,X2] at time t. 

The fact that these state variables are conserved means that 
should be constant with respect to t. The functions qj themselves, represent- 
ing the spatial distribution of the state variables at time t, will generally 
change as time evolves. The main assumption underlying (2.1) is that know- 
ing the value of q{x,t) at a given point and time allows us to determine 
the rate of flow, or flux, of each state variable at (x,i). The flux of the jth 
component is given by some function fj{q{x^t)). The vector- valued function 
f{q) with jth component fj{q) is called the flux function for the system of 
conservation laws. 

To see how conservation laws arise from physical principles, we will begin 
by deriving the equation for conservation of mass in a one-dimensional gas 
dynamics problem, for example flow in a tube where properties of the gas such 
as density and velocity are assumed to be constant across each cross section 
of the tube. Let x represent the distance along the tube and let g{x,t) be 
the density of the gas at point x and time t. This density is defined in such 
a way that the total mass of gas in any given section from xi to X 2 , say, is 
given by the integral of the density: 

rX2 

mass in [x\^X2] at time dx . (2.2) 

J XI 

If we assume that the walls of the tube are impermeable and that mass is 
neither created nor destroyed, then the mass in this one section can change 
only because of gas flowing across the endpoints x\ or X 2 - 
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Now let u(x, t) be the velocity of the ga^ at the point x at time t. Then 
the rate of flow, or flux of gas past this point is given by 



mass flux at {x,t) = g{x,t)u{x^t) . 



(2.3) 



By our comments above, the rate of change of mass in [xi,X 2 ] is given by the 
difference in fluxes at x\ and X 2 - 

f-X2 

(2.4) 



d 

— J Q{x,t)dx = g{xi,t)u{xi,t) - g{x 2 ,t)u{x 2 ,t) . 



This is one integral form of the conservation law. Another form is obtained 
by integrating this in time from ti to t 2 ^ giving an expression for the mass 
in [x\^X 2 ] at time t 2 > t\ in terms of the mass at time t\ and the total 
(integrated) flux at each boundary during this time period: 

pX2 pX2 

/ g{x,t 2 )dx = / g{x,ti)dx (2.5) 

Jxi J X\ 

t2 



Pl2 Pl2 

+ / g{xiyt)u{xi^t)dt - g{x2^t)u{x2,t)dt . 

Jti Jti 



To derive the differential form of the conservation law, we must now as- 
sume that g{x, t) and u{x, t) are differentiable functions. Then using 

g{x,t 2 ) - g{x,ti) = ^^g{x,t)dt (2.6) 

and 

fXi Q 

g{x 2 ,t)u{x 2 ,t) - g{xi,t)u{xi,t) ^ J —{g{x,t)u{x,t))dx (2.7) 

in (2.5) gives 

pt2 r g g >| 

J J = 0 ' (2-8) 

Since this must hold for any section [x\^X 2 ] and over any time interval [^ 1 ,^ 2 ], 
we conclude that in fact the integrand in (2.8) must be identically zero, i.e.. 



Qt + (^^)z = 0 conservation of mass . 



(2.9) 



This is the desired differential form of the conservation law for the conser- 
vation of mass, which is often called the continuity equation. 

Exactly the same derivation applies more generally if we replace g by the 
density of any conserved quantity, and let f{g) be the corresponding flux 
function describing the flow of this quantity. 

The simplest example of a conservation law consists of the equation (2.9) 
alone in the case where u is constant. This models flow of an incompress- 
ible fluid in a one-dimensional pipe at constant velocity. In this case g would 
model not the density of the fluid but the concentration of something be- 
ing transported with the fluid (a chemical, or heat perhaps). If we relabel 
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this concentration as g, then we have the simple linear scalar advection 
equation, 

qt + — 0 . ( 2 . 10 ) 

2.1 The Euler Equations of Gas Dynamics 

In gas dynamics, the equation (2.9) must be solved in conjunction with equa- 
tions for the conservation of momentum and energy: 

{Qu)i -h {qv?' •fp)x = 0 conservation of momentum , ( 2 . 11 ) 

Et H- {u[E -l-p))^, = 0 conservation of energy . (2.12) 

The resulting system of three conservation laws gives the Euler equations 
of gas dynamics. Note that these equations involve another quantity, the 
pressure p, which must be specified as a given function of pu, and E in 
order that the fluxes be well defined functions of the conserved quantities 
alone. This additional equation is called the equation of state and depends 
on physical properties of the gas under study. We will consider the simplest 
case of a poly tropic gas: 

E = + \qu'^ , (2.13) 

where 7 is the ratio of specific heats. From statistical mechanics it can be 
shown that 7 =: (a -(- 2 )/a, where a is the number of internal degrees of 
freedom of the gas molecules. For a monatomic gas a = 3 and 7 = 5/3, which 
is frequently the proper value for astrophysical calculations. In aerodynamics 
7 = 1.4 = 7/5 is generally used since air consists primarily of diatomic 
molecules. 

If we introduce the vector ^ 6 IR^ as 





g{x,t) 


' q\ ' 




11 


Q{x^t)u{x^t) ~z 


= Q2 


(2.14) 




E{x, t) 


_ ^3 _ 





then the system of equations (2.9), (2.11), (2.12) can be written simply as 
Qi + f{q)x = 0 , (2.15) 

where 



r 1 1 


92 




II 

to 

+ 

II 


92/9i+p(<?) 


(2.16) 


lu{E-\-p) ] 1 


. 92(93 +p(9))/9i 





Again, the form (2.15) is the differential form of the conservation laws, which 
holds in the usual sense only where u is smooth. More generally, the integral 
form for a system of m equations says that 
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d 

— J q{x, t) dx = f(q{xi ,t)) - f{q{x 2 , t)) 
for all xi, X 2 , t. Equivalently, integrating from ti to t 2 gives 

pX2 PX2 

I q{x,t 2 )dx = / q{x,ti)dx 

J X\ J X\ 

+ / f{.q{xi,t))dt- [ f{q(x 2 ,t))dt 

Jti Jti 



(2.17) 



(2.18) 



for ail xi, X 2 , ti, and ^ 2 - These integral forms of the conservation law are 
the fundamental physical conservation laws. It is important to note that 
these equations make sense even if the function q{x,t) is discontinuous, while 
transforming to the differential form (2.15) is valid only when q is smooth. 



2.2 Dissipative Fluxes 

So far we have ignored dissipative effects such as molecular diffusion, heat 
conduction or fluid viscosity. These effects generally lead to fluxes that depend 
on the spatial gradient of the state variable q rather than on the value of q. 

As a simple example, consider a fluid that is flowing through a pipe at 
constant speed u and let q represent the concentration of some chemical, or 
heat, in the fluid. According to Fourier’s law of heat conduction, q diffuses at 
a rate -(xqx which is proportional to the derivative of g, as well as advecting 
with the flow. Then the flux function is 

/(?, Qx) =uq- nq^ 

and we obtain the advection-diflfusion equation 

qt + '^qx = p^qxx • (2.19) 

If u = 0 then this is simply the heat equation. 

Similarly, adding fluid viscosity and heat conduction to the Euler equa- 
tions gives the full Navier-Stokes equations, which include terms that 
depend on the second derivative of the velocity (in the case of viscosity) and 
the second derivative of the temperature (for heat conduction). 

Adding these dissipative fluxes causes problems numerically because the 
equations are now parabolic rather than hyperbolic. This typically causes 
severe time step restrictions as discussed in Sect. 1.3.3 and requires the use 
of implicit methods for efficient solution. 

2.3 Source Terms 

In deriving the conservation law (2.15) we assumed that the “total mass” of 
q over an arbitrary interval [xi, X 2 ], given by the integral (2.2), should change 
only due to fluxes through the endpoints of this interval. In many cases this 
is not true. There may be some creation or destruction (source or sink) of 
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the “conserved quantity” within this interval. This leads to the introduction 

of source terms in the equation. Here are a few examples of the sorts of 

effects that lead to source terms: 

- Reacting flow or ionization, in which case we have a separate continuity 
equation for the mass of each species. The equations model the advection 
with the flow, as in (2.9), but also include source terms for the chemical 
kinetics, corresponding to the fact that each species is not individually 
conserved. 

- External forces such as gravity, which lead to a source term in the mo- 
mentum and energy equations. (In astrophysics one must often consider 
self-gravitation as well, in which case the gravitational force must be com- 
puted from the density function and changes with time.) 

~ Radiative heat transfer, which transfers energy on much faster time scales 
than the fluid dynamics being modeled and results in source terms in the 
energy equation. 

- Geometric source terms, which arise when a problem in several space 
dimensions is reduced to a lower-dimensional problem using spherical or 
cylindrical symmetry, for example. 



To see how source terms come into the conservation law, let’s reconsider 
the expression (2.17) in the situation where there is another source of q affect- 
ing t) dx other than fluxes at xi and 2 : 2 . This source will be denoted 

in general by ^(^), with the idea that 'ip{q{x,t)) represents the increase in q 
(or decrease, 'ip < 0) at the point (x, t) per unit time, due to effects of the 
sort described above. Then in place of (2.4) we have 



_d 

d^ 




dx = Fi {t) — 



F 2 {t) + f 'ip{q{x,t))dx . 
J a:i 



( 2 . 20 ) 



Note that the source term must be integrated over the interval since there is 
an effect at each point in the interval. 

Proceeding as before, assuming q, /, and ip are smooth, we will obtain 
the differential equation 



qt + f{q). = m ■ ( 2 . 21 ) 

This is a nonhomogeneous version of the conservation laws considered thus 
far. Note that if there are no spatial variations, g(x,t) = q{t), then the equa- 
tion (2.21) reduces to the ordinary differential equation 

qt - i’iq) . ( 2 . 22 ) 



2.4 Radiative Transfer and Isothermal Equations 

If we wish to include radiative heat transfer in a calculation then we must 
include source terms corresponding to the energy radiated or absorbed at 
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each point. (See [166], [217], and the lectures of Dorfi and Mihalas in this 
volume for a complete discussion of radiative transfer.) Typically this is very 
complicated because we must model the fact that radiation from all points 
in the computational domain may be absorbed at each point x, and so the 
source term at x involves an integral over the entire domain. Such problems 
can presumably be handled by the techniques discussed in Sect. 5., but this 
may be computationally expensive due to the need to approximate these 
integrals in each time step. 

A simple special case arises if we assume that radiation is not reabsorbed 
within the computational domain (the optically thin limit). This is a rea~ 
sonable model in studying the dynamics of an interstellar molecular cloud, 
for example. Since molecular motion radiates heat very effectively, such a 
cloud may have a temperature of only a few degrees Kelvin, and any heat 
generated by a shock wave passing through the cloud will be rapidly radiated 
away. In this case it may be appropriate to use a source term in the energy 
equation which simply drives the temperature exponentially quickly towards 
some constant value To on time scale r (Newton’s law of cooling). Then the 
one-dimensional Euler equations become 

Qt + {qu)x = 0 , 

{Qu)t + {QU^ +p)x = 0, (2.23) 

Et + u{E + p)x = -q{T - To)/t . 

If the radiation time scale r is very small relative to the dynamic time 
scales that we hope to model numerically, then these source terms will be 
very “stiff”, as discussed in Sect. 5.4. An alternative in this case would be 
to use the isothermal Euler equations, which is obtained by replacing the 
ideal gas equation of state (2.13) by the simple equation of state that results 
from assuming that the temperature is in fact constant everywhere. Since 
T = p/TZq, this results in the equation of state 

p = TZqTq = a^Q , (2.24) 

where TZ is the gas constant. Then conservation of mass and momentum gives 
the system of two isothermal equations 

Qt + = 0 , (2.25) 

{^)t + + “^^) = 0 ■ (2-26) 

In this case energy is not conserved due to radiation, the equation of state 
does not involve E, and we do not need an energy equation since the isother- 
mal assumption allows us to compute the energy directly from q and qu if 
desired. 

The full Euler equations are sometimes used to simulate isothermal sit- 
uations by setting 7 to a value slightly larger than 1 , say 7 = 1.001. This 
corresponds to taking the number of internal degrees of freedom a in the 
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gas to be very large (see Sect, 2,1). Then if the gas is heated by a shock, for 
example, this heat will be spread around among so many degrees of freedom 
that it has little impact on the pressure or temperature, so that isothermal 
behavior results provided the initial data has constant temperature. 



2.5 Multi-dimensional Conservation Laws 



Conservation laws generalize naturally to more than one space dimension. In 
three dimensions, the equation takes the form 



qt + f{q)x + g{q)y + Kq)^ = o , 



(2.27) 



where g{q) and h{q) give the fluxes in the y- and z-directions respectively 
for a given state q. This can again be derived from the more fundamental 
integral form. Let R = [xi^ X 2 ] x [^ 1 , ^ 2 ] x [ 2 ^ 1 , 2 ^ 2 ] be a cube in 3-space aligned 
with the coordinate directions. Then 



IIL q{x,t2)dx - IIL q{x, ti) dx = 

ri 2 pZ 2 rV 2 

/ / / f{q{xi,y,z,t)dydzdt 

J ti J z\ J y\ 

pi2 PZ2 PV2 

- / / / f{q{x2,y,z,t)dydzdt 

J z\ Jyi 

pt2 pZ2 pX2 

+ / / f{q{x,yi,z,t)dxdzdt 

J t\ J Z\ J X\ 

pt2 PZ2 PX2 

- / / f{q{x,y2,z,t)dxdzdt 

J t\ J Z\ J X\ 



(2.28) 



+ 



pt 2 pV 2 P ^2 

/ / / f{q(x,y,zi,t)dxdydt 

Jti Jyi Jxi 



pt 2 py 2 PX 2 

- / / / f{q{x,y,Z2,t)dxdydt 

Jti Jyi Jxi 



and manipulations similar to those performed above lead to the differential 
equation (2.27) when q is sufficiently smooth. 

We do not need to use a cube in this construction. More generally let i? 
represent any volume in 3-space. Then the (outward) flux through the surface 
of Q at any point x — {x^y^z) is given by J^{q{x,t)) • n where T — (/,^,/i) 
and n is the (outward) normal to the surface of 17 at x. The total net outward 
flux can then be computed by integrating this flux over the surface 9l7, so 

[[[ q{x,t 2 )dx- [ f [ q{x,ti)dx ^ - f f T{q)'ndA. (2.29) 
J J J Q J J J Q J JdQ 

By Gauss’s theorem, when q is smooth we can rewrite this as 

IIL q{x,t 2 )dx - III , (2.30) 
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where 



V • T{q) = f{q)x + 9(q)y + h(q)x . 


(2.31) 


This leads to 






(2.32) 



and hence again to (2.27) since i7, and t 2 are arbitrary. 

The Euler equations in three space dimensions take the form 



Qt + {qu)x + (ev)y + {gw)x = 0 , 

{gu)t + {qu^ + p)x + {Quv)y + {quw)x = 0 , 

{gv)t + {quv)x + (qv^ + p)y + igvw)z = 0 , 
(gw)t + {guw)x + {gvw)y + {gw^ +p)^ = 0 , 

Et + {u{E + p))x + {v{E + p))y + {w{E + p))x = 0. 



(2.33) 



Note that the forces that affect the momentum come from the stress tensor 



T = 



QU^ -j-p 
guv 

QUW 



guv 

gv^ +P 

gvw 



guw 

gvw 

gw^ T p 



~ guu + pi , 



(2.34) 



where uu — n (g) xx represents the tensor product of these vectors. 



2.6 The Shock Tube Problem 

A simple example that illustrates the interesting behavior of solutions to 
conservation laws is the “shock tube problem” of gas dynamics. The physical 
set-up is a tube filled with gas, initially divided by a membrane into two 
sections. The gas has a higher density and pressure in one half of the tube 
than in the other half, with zero velocity everywhere. At time t ~ 0, the 
membrane is suddenly removed or broken, and the gas allowed to flow. We 
expect a net motion in the direction of lower pressure. Assuming the flow 
is uniform across the tube, there is variation in only one direction and the 
one-dimensional Euler equations apply. 

The structure of this flow turns out to be very interesting, involving three 
distinct waves separating regions in which the state variables are constant. 
Across two of these waves there are discontinuities in some of the state vari- 
ables. A shock wave propagates into the region of lower pressure, across 
which the density and pressure jump to higher values and all of the state 
variables are discontinuous. This is followed by a contact discontinuity, 
across which the density is again discontinuous but the velocity and pressure 
are constant. The third wave moves in the opposite direction and has a very 
different structure: all of the state variables are continuous and there is a 
smooth transition. This wave is called a rarefaction wave since the density 
of the gas decreases (the gas is rarefied) as this wave passes through. 
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The interface between the two initial gases moves along the contact dis- 
continuity, hence the name. Figure 2,1 shows particle paths in the x-t plane 
for one example with gi = pi — S and = Pr = 1* This particular problem 
is called the Sod problem since Sod [225] used it as a test in an influential 
early comparison of different numerical methods. With this data the temper- 
ature T, which is proportional to p/g^ is initially the same everywhere but 
the gas to the left has a higher density and pressure. 



pressure density 





velocity particle paths in x-t plane 





Fig. 2.1. Solution to Sod’s shock tube problem for the Euler equations 



One plot in Fig. 2.1 shows some particle paths in the x-t plane. Notice 
that the gas is accelerated abruptly across the shock and smoothly through 
the rarefaction. At any fixed t the particles shown are spaced proportional 
to l/g (the specific volume), so wider spacing indicates lower density. Note 
that there is a jump in density across the contact discontinuity while the 
fluid velocity is the same on both sides, and equal to the speed at which this 
discontinuity propagates, so particles never cross this contact surface. Note 
also the decrease in density cls particles go through the rarefaction wave but 
the compression across the shock. 

If we put the initial discontinuity at x = 0, then the resulting solution 
q{x,t) is a “similarity solution” in the variable x/^, meaning that q{x,t) can 
be expressed as a function of x/t alone, say q{x^t) = u;(x/t). It follows that 
g(x, t) — q{ax^at) for any a > 0, so the solution at two different times t and 
at look the same if we rescale the x-axis. This also means that the waves 
move at constant speed and the solution q{x,t) is constant along any ray 
x/t — constant in the x-t plane. Figure 2.1 shows the solution as a function 
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of X at time t = but also shows the solution at any other time t > 0 by 
simply rescaling the x-coordinate by multiplying by t. 

In a real experimental shock tube, the state variables would not be dis- 
continuous across the shock wave or contact discontinuity because of effects 
such as viscosity and heat conduction. These are ignored in the Euler equa- 
tions. If we include these effects, using the full Navier-Stokes equations, then 
the solution of the partial differential equations would also be smooth. How- 
ever, these smooth solutions would be nearly discontinuous, in the sense that 
the rise in density would occur over a distance that is microscopic compared 
to the natural length scale of the shock tube. If we plotted the smooth solu- 
tions they would look indistinguishable from the discontinuous plots shown in 
Fig. 2.1. For this reason we would like to ignore these viscous terms altogether 
and work with the simpler Euler equations. 

The shock tube problem is a special case of what is known mathematically 
cLS the Riemann problem. In general, the Riemann problem consists of the 
conservation law qt -f f{q)x = 0 together with the special initial data 



q{x,0) 



qi if X < 0 

qr if X > 0 . 



(2.35) 



This data consists of two constant states separated by a discontinuity. It 
turns out that this problem is relatively easy to solve in general, and that the 
resulting wave structure reveals a lot about the structure of solutions more 
generally. Many numerical methods are based on solving Riemann problems, 
including the methods discussed below. 

Figures 2.2 and 2.3 show two more examples of solutions to Riemann 
problems for the Euler equations. The initial data chosen in these figures 
gives an indication of structures typically seen in certain calculations of stellar 
atmosphere dynamics. See Sect. 3.4 for details on how the solutions to these 
Riemann problems are computed. 

Figure 2.2 shows a shock tube problem with data u = 0, gi = ^ 

and a pressure jump from pi = 10 to Pr = 1. In this case the gas to the left 
has a higher temperature T = p/Tlg, as might occur if this gas has just been 
burned in the initiation of an explosion, for example. The higher pressure on 
the left leads to an expansion of this gas so that the rarefaction wave reduces 
the density to a lower value than gi = gr- This expansion requires the gas to 
move to the right, creating a shock wave that moves into the initially cooler 
gas, compressing it and raising its density to a value substantially higher than 
gi = g^. Across the contact discontinuity there is a large jump in density, and 
still a jump in temperature between the hot and cool gases, but the pressures 
are now equal. 

The high-density region between the shock and contact consists of initially- 
cool gas that has been “swept up” by the expanding hot gas. Structures of this 
form arise in supernova explosions, for example. An explosion might be mod- 
eled at the simplest level by a “spherical Riemann problem” with piecewise 
constant data in three dimensions having one constant state outside a sphere 
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pressure 




velocity 




density 




particle paths in x-t plane 




Fig. 2.2. Solution to a Riemann problem for the Euler equations where the gas to 
the left is at a higher temperature 



of some radius Rq and a different constant state, with much higher tempera- 
ture, inside this sphere. For short times the solution along any ray from the 
origin will look roughly like what is seen in Fig. 2.2, with a spherical shell of 
high-density gas moving outward. (See Fig. 6.1 for a multi-dimensional calcu- 
lation.) Three dimensional effects will lead to an attenuation of this structure 
and more complicated phenomena as the rarefaction wave reflects at the ori- 
gin. In modeling a supernova explosion one must also include gravitational 
forces so that the density should initially be decaying with r rather than con- 
stant, and of course a variety of other effects such as radiative transfer are 
crucial. Nonetheless, the same basic structure seen in Fig. 2.2 can be identi- 
fied in realistic simulations of supernovae, as seen for example in Fig. 3.10 of 
Muller’s lectures in this volume. 

Note that when gravitational forces are included, directed towards r = 0, 
this dense shell will be separated from lighter gas below by a contact surface. 
In a three-dimensional model this surface would initially be spherical but 
would be quickly broken up by Rayleigh-Taylor instabilities. This is discussed 
further in Section 3.4 of Muller’s lectures. See also Fig. 3.15 of his notes, where 
the stable shock and unstable contact surface are clearly seen (though here 
the instability is a results of energy deposition by neutrinos). 

Figure 2.3 shows the solution to a different Riemann problem where the 
initial data is 

Qi — I? '^i — 3, Pi 1 , 

2 , Uj- 1 , P>p 1 . 
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density 




particle paths In x-t plane 




- 2-1012 



Fig. 2.3. Solution to a Riemann problem for the Euler equations where the gas to 
the left has a higher velocity 

The gas on the left is moving faster than the gas on the right, and the 
collision produces two shock waves, one moving into each gas. As in the 
previous example, the shock moving to the right sweeps up material into a 
high-density layer. Again the spherical shell analog of this Riemann problem 
arises in astrophysics, in the formation of planetary nebulae, for example. An 
old star, in which the hydrogen and helium fuel has been exhausted, reaches 
a period of instability and ejects up to 80% of its mass in a so-called “slow 
wind”, characterized by velocities of about 10 km/sec. This is followed by a 
“fast wind” phase with velocities more on the order of 1000 km/sec. The fast 
wind forms a hot (10® K) invisible bubble which expands supersonically into 
the slow wind from inside, forming a thin dense shell of swept-up gas ahead 
of it. The emerging hot core of the central star ionizes the swept up shell and 
the undisturbed slow wind upstream of it. The ionized gas emits fluorescent 
and recombination-line radiation making the “planetary nebula” visible, see 
for example Fig. 2.4. The photo on the right shows NGC 6826, a relatively 
smooth and almost spherical nebula. The wind-compressed swept-up shell 
behind the outward moving shock is seen as the bright leading edge of the 
hot expanding bubble. 

In many Ccises the ejected slow wind is not at all spherically symmetric, 
leading to structures such as the Hourglass Nebula MyCn 18 shown on the 
left in Fig. 2.4. One possible mechanism for this is an asymmetric density 
distribution in the slow wind, allowing the wind-swept gas and the shock 
that precedes it to propagate faster in some directions than others. This is 
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Fig. 2.4. Photographs of planetary nebulae, visible evidence of dense swept-up 
shells generated by the stars which appear as bright dots in the center of each image. 
Observations by various teams using the WFPC2 camera on NASA’s Hubble Space 
Telescope, provided by B. Balick 



an active area of computational research. Some references include [76], [90], 
[113], [164]. 

A jet of gas intruding into an ambient medium also gives a structure of 
the form seen in Fig. 2.3 at the leading tip. Extragalactic jets are discussed 
by Muller, and their structure is investigated in Section 5.3 of his notes. See 
in particular Figs. 5.3 and 5.5. 

Similar shock structures arise from the head-on collision of stellar winds 
between the two stars of a binary system. In this case, the shocks approach a 
steady state location between the stars, at least in the coordinate frame co- 
rotating with the orbiting system. The orbital motion creates spiral-shaped 
shocks. Figure 2.5 shows the mass density for such a system over a cross- 
section through the orbital plane. This is a computational result from a three- 
dimensional calculation where adaptive mesh refinement has been used, as 
described in Sect. 6.3.2, in order to concentrate grid points where they are 
most needed near the shocks waves between the stars. The rectagular regions 
on the plot show regions of grid refinement. In this calculation two shocks 
are clearly visible. In general there is also a contact discontinuity, but in this 
particular calculation this is very weak. 

Mass is continually flowing out from the stars, through the shocks, and 
into the region between the shocks from which it must flow outwards in 
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Fig. 2.5. Adaptive mesh refinement calculation of colliding winds between binary 
stars. Density in the orbital plane from a full three-dimensional calculation. The 
rectangles show regions of refinement (see Sect. 6.3.2). Calculation by R. Walder 



directions roughly parallel to the shocks. This stationary structure can only 
exist in multidimensional problems and is coupled with strong transverse flow 
through the shocks. Any contact discontinuity would also be a slip line across 
which the two flows move parallel, typically at different speeds. This can lead 
to Kelvin-Helmholtz instabilities and roll-up of the slip lines. The thin region 
between the two shocks is also subject to the “thin shell instability” described 
for example in [230], [70], [251]. (See also Sect. 7.7.) See [177], [253] for more 
discussion of binary systems. 
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3. Mathematical Theory of Hyperbolic Systems 

3.1 Scalar Equations 

We begin our study of conservation laws by considering the scalar case. Many 
of the difficulties encountered with systems of equations are already encoun- 
tered here, and a good understanding of the scalar equation is required before 
proceeding. 

3.1.1 The Linear Advection Equation. We first consider the linear ad- 
vection equation, 

+ ugx = 0 . (3.1) 

The Cauchy problem is defined by this equation on the domain — oo < a: < 
oo, i > 0 together with initial conditions 

gf(a;,0) = qoix) . (3.2) 

The solution to this problem is simply 

q{Xj t) = qo{x - ut) (3.3) 

for > 0, as can be easily verified. As time evolves, the initial data simply 
propagates unchanged to the right (if u > 0) or left (if u < 0) with velocity 
u. The solution q{x,t) is constant along each of the rays x - ut = xq, which 
are known as the characteristics of the equation. 

Note that the characteristics are curves in the x-t plane satisfying the 
ordinary differential equations x^{t) = u, a:(0) = Xq- If we differentiate q{Xj t) 
along one of these curves to find the rate of change of q along the character- 
istic, we find that 

— g(x(f),0 = —q{x{t),t) + -^q{x{t),t)x'{t) 

— qt (2‘^) 

= 0 , 

confirming that q is constant along these characteristics. 

3.1.2 Burgers’ Equation, Now consider the nonlinear scalar equation 

Qt + f(q)x = 0 , (3.5) 

where f{q) is a nonlinear function of q. We will assume for the most part 
that f{q) is a convex function, f^'{q) > 0 for all q (or, equally well, / is 
concave with f^'(q) < 0 Vq). The convexity assumption corresponds to a 
“genuine nonlinearity” assumption for systems of equations that holds in 
many important cases, such as the Euler equations. The nonconvex case is 
also important in some applications (e.g., in MHD, see Sect. 8.) but is more 
complicated mathematically. 
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By far the most famous model problem in this field is Burgers’ equation, 
in which f{q) = so (3.5) becomes 

Qt+Qqx=0. (3.6) 

Actually this should be called the “inviscid Burgers’ equation”, since the 
equation originally studied by Burgers also includes a viscous term: 

qt + QQx = £Qxx ■ (3.7) 

This is about the simplest model that includes the nonlinear and viscous 
effects of fluid dynamics. 

Consider the inviscid equation (3.6) with smooth initial data. For small 
time, a solution can be constructed by following characteristics. Note that 
(3.6) looks like an advection equation, but with the advection velocity q 
equal to the value of the advected quantity. The characteristics satisfy 

x'{t) = q{x{t),t) (3.8) 

and along each characteristic q is constant, since 

^q{x{t),t) = ^q{x{t),t) + ^q{x{t),t)x'{t) 

= qt + qqx (3.9) 

= 0 . 

Moreover, since q is constant on each characteristic, the slope x^t) is constant 
by (3.8) and so the characteristics are straight lines, determined by the initial 
data. 

If the initial data is smooth, then this can be used to determine the 
solution q{x^t) for small enough t that characteristics do not cross: for each 
(x, t) we can solve the equation 

x = e + g(^,0)i (3.10) 

for ^ and then 

q{x,t) = q{^,0) . (3.11) 

3.1.3 Shock Formation. For larger t the equation (3.10) may not have a 
unique solution. This happens when the characteristics cross, as will even- 
tually happen if g^a:(a;,0) is negative at any point. At the time Tt where the 
characteristics first cross, the function g(x, t) has an infinite slope — the wave 
“breaks” and a shock forms. Beyond this point there is no classical solution of 
the PDE, and the weak solution we wish to determine becomes discontinuous. 

For times beyond the breaking time some of the characteristics have 
crossed and so there are points x where there are three characteristics leading 
back to t = 0. One can view the “solution” q at such a time as a triple- valued 
function. However, the density of a gas cannot possibly be triple valued at a 
point. We can determine the correct physical behavior by adopting the van- 
ishing viscosity approach. The equation (3.6) is a model of (3.7) valid only 
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for small e and smooth q. When it breaks down, we must return to (3.7). If the 
initial data is smooth and e very small, then before the wave begins to break 
the eqxx term is negligible compared to the other terms and the solutions to 
the two PDEs look nearly identical. However, as the wave begins to break, 
the second derivative term grows much faster than q^^ and at some point 
the eqxx term is comparable to the other terms and begins to play a role. 
This term keeps the solution smooth for all time, preventing the breakdown 
of solutions that occurs for the hyperbolic problem. This smooth solution has 
a steep transition zone where the viscous term is important. As e — 0 this 
zone becomes sharper and approaches the discontinuous solution known as 
a shock. It is this vanishing- viscosity solution that we hope to capture by 
solving the inviscid equation. 

3.1.4 Weak Solutions. A natural way to define a generalized solution of 
the inviscid equation that does not require differentiability is to go back to 
the integral form of the conservation law, and say that g(x, t) is a generalized 
solution if (2.5) is satisfied for all xi, X 2 , ^ 1 ,^ 2 - 

There is another approach that results in a different integral formulation 
that is often more convenient to work with. This is a mathematical technique 
that can be applied more generally to rewrite a differential equation in a form 
where less smoothness is required to define a “solution” . The basic idea is to 
take the PDE, multiply by a smooth “test function”, integrate one or more 
times over some domain, and then use integration by parts to move derivatives 
off the function q and onto the smooth test function. The result is an equation 
involving fewer derivatives on q, and hence requiring less smoothness. 

In our case we will use test functions € Cq (IR x IR) , Here Cq is the space 
of functions that are continuously differentiable with compact support. If we 
multiply qt fx = 0 by 0(x, t) and then integrate over space and time, we 
obtain 

nOO r-\-00 

/ / \(j)qt + (t>f{q)x\^xdt = (i . (3.12) 

Vo j —00 

Now integrate by parts, yielding 



[<l)tq -h 0x/(^)] dxdt = - / 0(x, 0)^(x, 0) dx 



Note that nearly all the boundary terms which normally arise through in- 
tegration by parts drop out due to the requirement that (f> have compact 
support, and hence vanishes at infinity. The remaining boundary term brings 
in the initial conditions of the PDE, which must still play a role in our weak 
formulation. 



Definition 3.1. The function q{x^t) is called a weak solution of the conser- 
vation law if (3.13) holds for all functions (/> G Cq (IR x IR) . 

The vanishing- viscosity generalized solution we defined above is a weak 
solution in the sense of (3.13), and so this definition includes the solution we 
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are looking for. Unfortunately, weak solutions are often not unique, and so an 
additional problem is to identify which weak solution is the physically correct 
vanishing-viscosity solution. Again, one would like to avoid working with the 
viscous equation directly, but it turns out that there are other conditions one 
can impose on weak solutions that are easier to check and will also pick out 
the correct solution. These are usually called entropy conditions by analogy 
with the gas dynamics case, where a discontinuity is physically realistic only 
if the entropy of the gas increases as it crosses the shock. 



3.1.5 The Riemann Problem. The conservation law together with piece- 
wise constant data having a single discontinuity is known as the Riemann 
problem. As an example, consider Burgers’ equation qt -h qq^ = 0 with piece- 
wise constant initial data 




if X < 0 
if X > 0 . 



(3.14) 



The form of the solution depends on the relation between qi and Cr- 



ease I. qi > 

In this case there is a unique weak solution. 




\{ X < st 
if X > st , 



where 



(3.15) 



s={qi+qr)l2 (3.16) 

is the shock speed, the speed at which the discontinuity travels, A general 
expression for the shock speed will be derived below. Note that characteristics 
in each of the regions where q is constant go into the shock (see Fig. 3,1) as 
time advances. 




Fig. 3.1. Shock wave 



Case II. qi < qr^ 

In this case there are infinitely many weak solutions. One of these is again 
(3.15), (3.16) in which the discontinuity propagates with speed s. Note that 
characteristics now go out of the shock (Fig. 3.2(a)) and that this solution is 
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X = st 




(b) 0 0 

Fig. 3,2. (a) Entropy- violating shock, (b) Rarefaction wave 



not stable to perturbations. If the data is smeared out slightly, or if a small 
amount of viscosity is added to the equation, the solution changes completely. 
Another weak solution is the rarefaction wave 

{ qi if X < qit 

xlt if qit < X < qrt (3-17) 

qr if X > qrt , 

This solution is stable to perturbations and is in fact the vanishing-viscosity 
generalized solution (Fig. 3.2(b)). 



3.1.6 Shock Speed. The propagating shock solution (3.15) is a weak solu- 
tion to Burgers’ equation only if the speed of propagation is given by (3.16). 
The same discontinuity propagating at a different speed would not be a weak 
solution. 

The speed of propagation can be determined by conservation. The relation 
between the shock speed s and the states qi and qr is called the Rankine- 
Hugoniot jump condition: 



fiqi) - f{qr) = s{qi - qr) ■ 



(3.18) 



For scalar problems this gives simply 

= /to) ~ /(g’-) ^ iZl 

qi ~ qr [g] ’ 



(3.19) 



where [•] indicates the jump in some quantity across the discontinuity. Note 
that any jump is allowed, provided the speed is related via (3.19). 

For systems of equations, qi — qr and f{qr) — f{Qi) are both vectors while 
s is still a scalar. Now we cannot always solve for s to make (3.18) hold. 
Instead, only certain jumps qi — qr are allowed, namely those for which the 
vectors f{qi) - f{qr) and qi - qr are linearly dependent. 

For a linear system with f{q) = Aq, (3.18) gives 




3. Mathematical Theory of Hyperbolic Systems 27 

Mqi - Qr) = s{qi - qr) , (3.20) 

i.e., qi — qr must be an eigenvector of the matrix A and s is the associated 
eigenvalue. For a linear system, these eigenvalues are the characteristic speeds 
of the system. Thus discontinuities can propagate only along characteristics, 
just as for the scalar advection equation. 



3.2 Linear Hyperbolic Systems 

We now begin to investigate systems of equations. We start with constant 
coefficient linear systems. Here we can solve the equations explicitly by trans- 
forming to characteristic variables. We will also obtain explicit solutions of 
the Riemann problem and introduce a “phase space” interpretation that will 
be very useful in our study of nonlinear systems. 

Consider the linear system 

qt -h Aqx = 0 , (3.21) 

9(x,0) =qQ{x) , 

where q : IR x IR -> IR”* and A € IR'”^'" is a constant matrix. This is a 
system of conservation laws with the flux function f{q) = Aq, This system 
is called hyperbolic if A is diagonalizable with real eigenvalues, so that we 
can decompose 

A = RAR-^ , (3.22) 

where A — diag(Ai, A 2 , • . . , A^^) is a diagonal matrix of eigenvalues and R = 
[ri|r 2 | • ♦ ‘ \rm] is the matrix of right eigenvectors. Note that AR = RA, i.e., 

Arp — XpVp for p = 1, 2, , . . , m . (3.23) 

The system is called strictly hyperbolic if the eigenvalues are distinct. We 
will always make this assumption as well. 

3.2.1 Characteristic Variables. We can solve (3.21) by changing to the 
“characteristic variables” 

V = R-^q . (3.24) 

Multiplying (3.21) by and using (3.22) gives 

vt + Av^ = 0. (3.25) 

Since A is diagonal, this decouples into m independent scalar equations. Each 
of these is a constant coefficient linear advection equation, with solution 

Vp{x, t) =Vp{x- Xpt, 0) . (3,26) 

Since v — R^^q^ the initial data for Vp is simply the pth component of the 
vector 

v{x,0) - R~^qo{x) . 



(3.27) 
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The solution to the original system is finally recovered via (3.24): 

q{x,t) = Rv{x,t) . (3.28) 

Note that the value Vp{x^t) is the coefficient of rp in an eigenvector ex- 
pansion of the vector g(x,<), i.e., (3.28) can be written out as 

m 

q{x,t) — ^^Vp{x,t)rp . (3.29) 

i—\ 

Combining this with the solutions (3.26) of the decoupled scalar equations 
gives 

m 

q{x, 0 ^ Vp{x - Xpt, 0)rp . (3.30) 

p=i 



Note that q{x, t) depends only on the initial data at the m points x — Apt, 
so the domain of dependence is given by D(x,t) = {x = x - Xpt^ p = 
1, 2, m}. 

The curves x = xo-f Apt satisfying x'(t) = Ap are the “characteristics of the 
pth family” , or simply “p- characteristics” . These are straight lines in the case 
of a constant coefficient system. Note that for a strictly hyperbolic system, m 
distinct characteristic curves pass through each point in the x~t plane. The 
coefficient t^p(x,t) of the eigenvector Vp in the eigenvector expansion (3.29) 
of g(x, t) is constant along any p-characteristic. 



3.2.2 The Riemann Problem. For the constant coefficient linear system, 
the Riemann problem can be explicitly solved. We will see shortly that the 
solution to a nonlinear Riemann problem has a simple structure which is 
quite similar to the structure of this linear solution, and so it is worthwhile 
studying the linear case in some detail. 

The Riemann problem consists of the equation qt + Aqx = 0 together with 
piecewise constant initial data of the form 

® : s 2 . (3,31) 



Recall that the general solution to the linear problem is given by (3.30). For 
the Riemann problem we can simplify the notation if we decompose qi and 
q^* a.s 



m 

91 = ’ 

P=1 



m 

9r = ■ 

p=l 



Then 

Wp(x,0) 



Vp X < 0 

Vp X > 0 



(3.32) 



and so 



(3.33) 
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Vp{x,t) 



( Vp if X — 

I Up if a: — 



-Xpt<0 

\pt>0. 



(3.34) 



Wp A*. *A/ 

If we let P{xj t) be the maximum value of p for which x — Xpt > 0, then 

P{x,t) m 

q(x, t)= v^rp + ^ v{,rp . (3.35) 

P=1 p=P{x,t)-\-l 



The determination of q{x^ t) at a given point is illustrated in Fig. 3.3. In the 
cajse shown, vi = v{ while V 2 = V 2 = V 3 . The solution at the point 

illustrated is thus 



q{x,t) = u[ri + v[r 2 + v[rz . (3.36) 

Note that the solution is the same at any point in the wedge between the 
x' = Ai and x' = A 2 characteristics. As we cross the pth characteristic, the 
value of X - Xpt passes through 0 and the corresponding Vp jumps from Vp to 
Vp. The other coefficients Vi {i ^ j) remain constant. 

The solution is constant in each of the wedges as shown in Fig. 3.4. Across 
the pth characteristic the solution jumps with the jump given by 

[ 9 ] = K - '>^‘pK ■ (3.37) 

Note that these jumps satisfy the Rankine-Hugoniot conditions (3.18), 
since f{q) = Aq leads to 

[/] = A[q] = (v^ - v‘p)Avp = Ap[g] 

and Xp is precisely the speed of propagation of this jump. The solution ^(x, t) 
in (3.35) can alternatively be written in terms of these jumps as 

q(x, t) = qi+ (Up - uj,)rp (3.38) 

\p<x/t 

= Qr- Y, (3-39) 

\p>x/t 

It might happen that the initial jump qr - qi is already an eigenvector of A, 

if qp - qi = {v^ -Vi)ri for some i. In this case v^p = for p ^ i. Then this 

discontinuity simply propagates with speed Ai, and the other characteristics 
carry jumps of zero strength. 

In general this is not the case, however, and the jump qr — qi cannot 
propagate as a single discontinuity with any speed without violating the 
Rankine-Hugoniot condition. We can view “solving the Riemann problem” 
as finding a way to split up the jump qr — qi into a sum of jumps, 

qr-qi = (u[ - u'l )ri + ■ ■ ■ + (t;;;^ - vljrm H airi + • • • + , (3.40) 

each of which can propagate at an appropriate speed Ai with the Rankine- 
Hugoniot condition satisfied. 




30 



R.J. LeVeque 




Fig. 3.3. Construction of the solution to the Riemann problem at We trace 

back along the p’th characteristic to determine the coefficient of rp depending on 
whether this lies in the left state or right state initially 



x' = A' x' - A^ = A^ 




Fig. 3.4. Values of solution q in each wedge of x-t plane: 

qi = v\r\ -I- v^2 + q* = vln + vlv2 + vara , 

qi ~ + ^2^2 + vlv 3 , qr = vlri + V2X2 + t^ 3^3 • 

Note that the jump across each discontinuity in the solution is an eigenvector of A 



For nonlinear systems we solve the Riemann problem in much the same 
way: The jump qr - qi will usually not have the property that [/] is a scalar 
multiple of [q], but we can attempt to find a way to split this jump up as a 
sum of jumps, across each of which this property does hold. (Although life is 
complicated by the fact that we may need to introduce rarefaction waves as 
well as shocks.) In studying the solution of the Riemann problem, the jump 
in the pth family, traveling at constant speed Ap, is often called the p-wave. 

3.2.3 The Phase Plane. For systems of two equations, it is illuminating to 
view this splitting in the phase plane. This is simply the qi~q 2 plane, where 
q = {quq2)- Each vector q{x,t) is represented by a point in this plane. In 
particular, qi and qr are points in this plane and a discontinuity with left and 
right states qi and qr can propagate as a single discontinuity only if qr — qi 
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q2 




Fig. 3.5. (a) The Hugoniot locus of the state qi consists of all states that diEer 
from qi by a scaleu: multiple of r\ or r 2 . (b) Solution to the Riemann problem in 
the x-t plane 



is an eigenvector of A, which means that the line segment from qi to qr must 
be parallel to the eigenvector r\ or r 2 . Figure 3,5 shows an example. For the 
state qi illustrated there, the jump from qi to qr can propagate as a single 
discontinuity if and only if qr lies on one of the two lines drawn through qi 
in the drection ri and r 2 * These lines give the locus of all points that can 
be connected to qi by a 1-wave or a 2-wave. This set of states is called the 
Hugoniot locus. We will see that there is a direct generalization of this to 
nonlinear systems in the next chapter. 

Similarly, there is a Hugoniot locus through any point qr that gives the 
set of all points qi that can be connected to qr by an elementary p-wave. 
These curves are again in the directions ri and r 2 * 

For a general Riemann problem with arbitrary qi and qr, the solution 
consists of two discontinuities travelling with speeds Ai and A 2 , with a new 
constant state in between that we will call q*. By the discussion above, 

q* = v{ri + V 2 T 2 , (3.41) 




Fig. 3.6. The new state q* cirising in the solution to the Riemann problem for two 
different choices of qi and In each case the jump from qi to q* lies in the direction 
of the eigenvalue ri corresponding to the slower speed, while the jump from g* to 
qr lies in the direction of the eigenvalue r 2 
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so that q* - qi = airi and qr ~ q* = oc 2 V 2 ^ The location of q* in the phase 
plane must be where the 1-wave locus through qi intersects the 2-wave locus 
through This is illustrated in Fig. 3.6a. 

Note that if we interchange qr and qi in this picture, the location of g* 
changes as illustrated in Fig. 3.6(b). In each case we travel from qi to qr by 
first going in the direction ri and then in the direction r 2 . This is required by 
the fact that Ai < A -2 since clearly the jump between qi and q* must travel 
slower than the jump between q* and qr if we are to obtain a single-valued 
solution. 

For systems with more than two equations, the same interpretation is pos- 
sible but becomes harder to draw since the phase space is now m dimensional. 
Since the m eigenvectors Vp are linearly independent, we can decompose any 
jump qr - qi into the sum of jumps in these directions, obtaining a piecewise 
linear path from qi to qr in m-dimensional space. 

3.3 Nonlinear Systems 

Now consider a nonlinear system of conservation laws 

qt + f{q)x = 0 , (3.42) 

where q : El x IR IR^ and / : IR"^ -)• IR”^. This can be written in the 
quasilinear form 

qt + A{q)qa: = 0 , (3.43) 

where A{q) — f\q) is the m x m Jacobian matrix. Again the system is 
hyperbolic if A{q) is diagonalizable with real eigenvalues for all values of 
q^ at least in some range where the solution is known to lie, and strictly 
hyperbolic if the eigenvalues are distinct for all q. We will assume that the 
system is strictly hyperbolic, and order the eigenvalues of A in increasing 
order, 

Ai < A-2 < ■ • ■ < Ani . (3.44) 

Since the eigenvalues are distinct, the eigenvectors are linearly dependent. 
We choose a particular basis for these eigenvectors, {rp(g)}^i , usually chosen 
to be normalized in some manner, e.g., ||rp(gf)l| = 1. 

In the previous section we constructed the solution to the general Rie- 
mann problem for a linear hyperbolic system of conservation laws. Our next 
goal is to perform a similar construction for the nonlinear Riemann problem. 
In the linear case the solution consists of m waves, which axe simply dis- 
continuities traveling at the characteristic velocities of the linear system. In 
the nonlinear case our experience with the scalar equation leads us to expect 
more possibilities. In particular, the physically relevant vanishing- viscosity 
solution may contain rarefaction waves as well as discontinuities. We will 
first ignore the entropy condition and ask a simpler question: is it possible 
to construct a weak solution of the Riemann problem consisting only of m 
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discontinuities propagating with constant speeds < S 2 < - • < 5m? As we 
will see, the answer is yes for \\qi — qr\\ sufficiently small. 

3,3.1 The Hugoniot Locus. Recall that if a discontinuity propagating 
with speed s has constant values q and w on either side of the discontinuity, 
then the Rankine-Hugoniot jump condition must hold, 

f{w) - f{q) = s{w - q) . (3.45) 

Now suppose we fix the point q G IR^ and attempt to determine the set of 
all points w which can be connected to g by a discontinuity satisfying (3.45) 
for some s. This gives a system of m equations in m 4- 1 unknowns: the m 
components of w, and s. This leads us to expect one-parameter families of 
solutions. 

We know that in the linear case there are indeed m such families for any 
q. In the pth family the jump w - q is some scalar multiple of rp, the pth 
eigenvector of A. We can parameterize these families of solutions using this 
scalar multiple, say and we obtain the following solution curves: 

Wp{^-,q) = q + ^rp, 

Spi^-,q) = K 

for p = 1, 2, . . . , m. Note that Wp{0; q) = q for each p and so through the 
point q in phase space there are m curves (straight lines in fact) of possible 
solutions. This is illustrated in Fig. 3.6 for the case m = 2. The two lines 
through each point are the states that can be connected by a discontinuity 
with jump proportional to r 1 or r 2 . 

In the nonlinear case we also obtain m curves through any point g, one 
for each characteristic family. We again parameterize these curves by Wp{^; q) 
with Wp{0; g) = g and let Sp(^; g) denote the corresponding speed. To simplify 
notation, we will frequently write these as simply lyp(^), Sp(^) when the point 
g is clearly understood. 

The Rankine-Hugoniot condition gives 

fiwpiO) - f(q) = spiOiwpiO - q) • (3.46) 

Differentiating this expression with respect to ^ and setting ^ = 0 gives 
f'iqWpiO) = Sp(0)«;p(0) , (3.47) 

SO that Wp{0) must be a scalar multiple of the eigenvector rp{q) of /'(g), while 
Sp(0) = Ap(g). The curve Wp{^) is thus tangent to rp(g) at the point g. For a 
system of m — 2 equations, this is easily illustrated in the phase plane. An 
example for the isothermal equations of gas dynamics is shown in Fig. 3.7. 
See [141] for details on the construction of these curves. 

For smooth /, it can be shown using the implicit function theorem that 
these solution curves exist locally in a neighborhood of g, and that the func- 
tions Wp and Sp are smooth. See Lax [136] or Smoller[221] for details. These 
curves are called Hugoniot curves. The set of all points on these curves is 




34 



R.J. LeVeque 





Fig. 3.7. (a) States qr that can be connected to qi = q by an entropy-satisfying 
shock in the density-momentum phase plane for the isothermal equations, (b) States 
qi that can be connected to 9 by an entropy-satisfying shock. In each case the 
entropy- violating branches of the Hugoniot locus axe shown as dashed lines 



often collectively called the Hugoniot locus for the point q. If qp lies on the 
pth Hugoniot curve through q^ then we say that q and qp are connected by a 
p-shock. 

3.3.2 All-Shock Solution of the Riemann Problem. Now suppose that 
we wish to solve the Riemann problem with left and right states qi and qr 
(and recall that we are ignoring the entropy condition at this point). Just as 
in the linear case, we can accomplish this by finding an intermediate state q* 
such that qi and q* are connected by a discontinuity satisfying the Rankine- 
Hugoniot condition, and so are q* and qr. Graphically we accomplish this by 
drawing the Hugoniot locus for each of the states qi and qr and looking for 
intersections. See Fig. 3.8 for an example with the isothermal equations. 

More generally, for a system of m equations we can attempt to solve the 
Riemann problem by finding a sequence of states gi, ^ 2 , • • • , Qm-i such that 
qi is connected to by a 1-shock, q\ is connected to q 2 by a 2-shock, and so 
on, with qm-\ connected to qr by an m-shock. If qi and qr are sufficiently close 
together then this can always be achieved. Lax proved a stronger version of 
this in his fundamental paper [135]. (By considering rarefaction waves also. 
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Fig. 3.8. Construction of a 2-shock solution to the Riemann problem for the isother- 
mal equations with left and right states qi and Qr . The intermediate state g* is ob- 
tained between the two shocks. If the states qi and qr were switched in the Riemann 
problem, then the intermediate state g** would be obtained instead 



the entropy satisfying solution can be constructed in a similar manner, as we 
will do in Sect. 3.3.6) 

3.3.3 Genuine Nonlinearity. The pth characteristic field is said to be 
genuinely nonlinear (a natural generalization of the convex scalar equa- 
tion) if 

VAp(g) • rp(g) ^0 for all g , (3.48) 

where VAp(g) = {dXp/dqiy dXpfdqm) is the gradient of Ap(g). Note 
that in the scalar case, m = 1 and Ai(g) = /'(g) while ri(g) = 1 for all 
g. The condition (3.48) reduces to the convexity requirement f”{q) 0 Vg. 

This implies that the characteristic speed /'(g) is monotonically increasing 
or decreasing as g varies, and leads to a relatively simple solution of the 
Riemann problem. 

For a system of equations, (3.48) implies that Ap(g) is monotonically in- 
creasing or decreasing as g varies along an integral curve of the vector field 
Tp(g). An integral curve of rp(g) is a curve with the property that it is ev- 
erywhere tangent to rp(g). This will be discussed in detail shortly, where 
we will see that through a rcirefaction wave g varies along an integral curve. 
Since monotonicity of the propagation speed Ap is clearly required through a 
rarefaction wave, genuine nonlinearity is a natural assumption. 

For the Euler equations the two acoustic waves are genuinely nonlinear. 

3.3.4 The Lax Entropy Condition. In defining the Hugoniot locus above, 
we ignored the question of whether a given discontinuity is physically relevant. 
The state g is in the Hugoniot locus of g if the jump satisfies the Rankine- 
Hugoniot condition, regardless of whether this jump could exist in a vanishing 
viscosity solution. We would now like to define an entropy condition that can 
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be applied directly to a discontinuous weak solution to determine whether the 
jumps should be allowed. Lax [135] proposed a simple entropy condition for 
systems of equations that are genuinely nonlinear. Lax’s entropy condition 
says that a jump in the pth field (from qi to qr, say) is admissible only if 

\p{qi) > s> Xpiqr) , (3.49) 

where s is again the shock speed. Characteristics in the pth family disappear 
into the shock as time advances, just as in the scalar case. Note that for a 
system of m equations we thus expect to have a total of m + 1 characteristics 
impinging on the shock, those from families 1 through p from the left and 
from families p through m from the right. 

3.3.5 Linear Degeneracy. The assumption of genuine nonlinearity is ob- 
viously violated for a constant coefficient linear system, in which Xp{q) is 
constant and hence VAp — 0. More generally, for a nonlinear system it might 
happen that in one of the characteristic fields the eigenvalue Ap(g) is constant 
along integral curves of this field, and hence 

VXp{q)^rp{q) = Q Vg . (3.50) 

(Of course the value of Xp{q) might vary from one integral curve to the next.) 
In this case we say that the pth field is linearly degenerate. 

In the Euler equations, the contact discontinuity is linearly degenerate. 
The characteristic speed is u, the fluid velocity, which is the same on each 
side of an arbitrary contact discontinuity. 

3.3.6 Rarefaction Waves and Integral Curves. We now turn our at- 
tention to rarefaction waves. All of the Riemann solutions considered so far 
have the following property: the solution is constant along all rays of the form 
X = Consequently, the solution is a function ofx/t alone, and is said to be 
a “similarity solution” of the PDE. A rarefaction wave solution to a system 
of equations also has this property and takes the form 

{ qi x<^it 

w{xft) ^\t<x<^ 2 t (3.51) 

Qr X> ^2t , 

where xe is a smooth function with xu(^i) = qi and xc(^ 2 ) = qr- 

When does a system of equations have a solution of this form? As in the 
case of shocks, for arbitrary states qi and Qr there might not be a solution 
of this form. But in general, starting at each point qi there are m curves 
consisting of points qr which can be connected to qi by a rarefaction wave. 
These turn out to be subsets of the integral curves of the vector fields rp{q). 

An integral curve for rp{q) is a curve which hcis the property that the tan- 
gent to the curve at any point q lies in the direction rp{q). The existence of 
smooth curves of this form follows from smoothness of / and strict hyperbol- 
icity, since rp{q) is then a smooth function of q. If gp(^) is a parameterization 
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(for ^ G IR) of an integral curve in the pth family, then the tangent vector is 
proportional to rp{qp{^)) at each point, i.e., 

q'piO = , (3.52) 

where a(^) is some scalar factor. 

A function of q that is constant along any integral curve is called a Rie- 
mann invariant of the system. Such functions can always be found for 
systems of two equations, and the integral curves are level sets of the invari- 
ants. Riemann invariants can be used to solve Riemann problems involving 
rarefaction waves. 

To see that rarefaction curves lie along integral curves, and to explicitly 
determine the function w{x/t) in (3.51), we differentiate q{x,t) = w{x/t) to 
obtain 



X 

qt(x,t) = --^w'{x/t), 


(3.53) 


qx{x,t) = 


(3.54) 


so that qt + f'{q)qx = 0 yields 




-^w'ixlt) + jf'{w{x/t))w'{x/t) = 0 . 


(3.55) 


Multiplying by t and rearranging gives 




f'iwiOWiO = , 


(3.56) 



where ^ = x/t. One possible solution of (3.56) is tc'(^) = 0, i.e., w constant. 
Any constant function is a similarity solution of the conservation law, and 
indeed the rarefaction wave (3.51) takes this form for ^ and ^ > ^ 2 - In 
between, w is presumably smoothly varying and w' ^ 0. Then (3.56) says 
that must be proportional to some eigenvector rp(iy(^)) of 

w'iO = aiOrp{wiO) (3.57) 

and hence the values ty(^) all lie along some integral curve of Vp. In partic- 
ular, the states qi = w{^i) and qr = '^{^ 2 ) both lie on the same integral 
curve. This is a necessary condition for the existence of a rarefaction wave 
connecting qi and qr, but note that it is not sufficient. We need ^ = x/t to 
be monotonically increasing as w{^) moves from qi to qr along the integral 
curve; otherwise the function (3.51) is not single- valued. If this characteristic 
field is genuinely nonlinear, then the eigenvalue will vary monotonically along 
the integral curve. 

Figure 3.9 shows the integral curves for the isothermal equations through 
the same state as shown in Fig. 3.7. See [141] for more details on the con- 
struction of these curves. Which points can be connected to this state by a 
physically realizable rarefaction wave depends on whether the fixed state is 
to be on the left or right, since the speed must increase from left to right. 
The realizable rarefaction waves are indicated by solid lines in the figure. 
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Fig. 3.9. (a) States Qt that can be connected to g/ = ^ by a rarefaction wave for the 
isothermal equations, (b) States qi that can be connected to = g by a rarefaction 
wave. In each case the full integral curves axe shown as dashed lines 



These integral curves look quite similar to the Hugoniot locus shown in 
Fig. 3.7. Locally, near the point g, they must in fact be very similar. We know 
already that in the pth family each of these curves is tangent to rp(g) at g. 
Moreover, it can be shown that the curvature of both curves is the same 
(See Lax [136]). The curves are not identical, however. Figure 3.10 shows a 
comparison of the two sets of curves through one particular state. 

3.3.7 General Solution of the Riemann Problem. We can combine 
the appropriate curves from Fig. 3.7 and Fig. 3.9 to obtain a plot showing all 
states that can be connected to a given g by entropy-satisfying waves, either 
shocks or rarefactions. Again, the nature of this plot depends on whether g is 
to be the left state or right state, so we obtain two plots as shown in Fig. 3.11. 
Here Sp is used to denote the locus of states that can be connected by a p- 
shock and IZp is the locus of states that can be connected by a p-rarefaction. 
Notice that the shock and rarefaction curves match up smoothly (with the 
same slope and curvature) at the point g. 

To solve the general Riemann problem between two different states g/ and 
qr , we simply superimpose the appropriate plots and look for the intersection 
qm of a 1-wave curve from g/ and a 2- wave curve from gr. An example for 
the isothermal equations is shown in Fig. 3.12. This is the same example 
considered in Fig. 3.8. We now see that the entropy-satisfying weak solution 
consists of a 1-shock from qi to qm followed by a 2-rarefaction from qm to g^. 
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Fig, 3.10. Compajison of the states that can be connected to q through a rarefac- 
tion or a shock. The solid lines are the Hugoniot locus while the dashed lines are 
the integral curves 





Fig. 3.11. (a) Set of states that can be connected to qi by an entropy-satisfying 
1-wave or 2- wave, (b) Set of states that can be connected to qr. In each case, 7Zp 
denotes p-rarefactions and Sp denotes p-shocks 
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Fig. 3.12. Construction of a weak solution to the Riemann problem with left and 
right states qi and qr . Solid lines show the Hugoniot locus (shocks) and dashed lines 
show the integral curves (rarefactions) 



3,4 The Riemann Problem for the Euler Equations 



If we compute the Jacobian matrix f\q) from (2.16), with the poly tropic 
equation of state (2.13), we obtain 



/'(?) = 



4(7+ l)w^ 



(3 - 7)i 



\_-u{E+p)Iq+ i (7 - l)u^ {E + p)fg- (7 - l)i 



0 ■ 
(7-1) 

7?i 



The eigenvalues are 

Ai(q)=u-c, A 2 (g)=u, \^{q)~u-\-c, 



(3.58) 



where c is the sound speed, 

(3.59) 

Of particular note in these equations is the fact that the second charac- 
teristic field is linearly degenerate. It is easy to check that 




r2{q) = 



1 

u 



(3.60) 



is an eigenvector of /'(?) with eigenvalue X 2 {q) = u = {eu)/g. Since 



VA2(g) = 



-u/g 

Vq 

0 



(3.61) 



we find that VA 2 • r 2 = 0. 
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Since the second field is linearly degenerate, we can have neither rarefac- 
tion waves nor shocks in this field. Instead we have contact discontinuities, 
which are linear discontinuities that propagate with speed equal to the char- 
acteristic speed A 25 which is simply the fluid velocity u. Across a contact 
discontinuity there is a jump in the density of the gas but the pressure and 
velocity axe continuous. 

The first and third characteristic fields are genuinely nonlinear and have 
behavior similar to the two characteristic fields in the isothermal equations. 
The solution to a Riemann problem typically has a contact discontinuity and 
two nonlinear waves, each of which might be either a shock or a rarefaction 
wave depending on qi and A typical solution was shown in Fig. 2.1. 

Because u and p are constant across the contact discontinuity, it is often 
easier to work in the variables (^, u,p) rather than (g^gUyE), although of 
course the jump conditions must be determined using the conserved variables. 
The resulting Hugoniot locus and integral curves can be transformed into 
(^>, u,p) space. 

If the Riemann data is (pz,u/,p/) and (gr,Ur,Pr), then the two new 
constant states that appear in the Riemann solution will be denoted by 
Qi and q* — {Qr,u*,P*) (see Fig. 2.1.). Note that across the 

2 - wave we know there is a jump only in density. 

Solution of the Riemann problem proceeds in principle just as before. 
Given the states qi and qr in the phase space, we need to determine the two 
intermediate states in such a way that qi and ql are connected by a 1 -wave, 
qi and q* are connected by a 2 - wave, and finally q* and qr are connected by 
a 3- wave. 

This procedure is greatly simplified by using the known structure of the 
solution. The 1 -wave Hugoniot locus and integral curves can be parameterized 
by the pressure. Let qi = {OhUi^pi) be the left state. Then for any choice of 
Pi we can determine values pj' and Ui so that (pz,u/,pz) and are 

connected by an admissible shock wave (if p^ > pz) or by a rarefaction wave 
(if Pt < Pi). This defines in particular a function for the velocity in 
terms of the pressure p^*, which depends of course on the left state qp, 

u! =U^{p:-,qi). 

This function is defined by integrating the Riemann invariant through the 
rarefaction wave or by using the jump conditions across a shock wave. 

Similarly, the 3-wave Hugoniot locus and integral curves can be parame- 
terized by pressure, defining an analogous function so that 

u; = uHp;-,qr) . 

This gives the velocity in the state connected to qr by a 3- wave when p* is 
the pressure in this state. 

Since the 2 - wave is always a contact discontinuity, we know that u and p 
must be constant across this wave. We can thus reduce the solution process 
for the general Riemann problem to finding the single value p* = pj' — p* 
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(the intermediate pressure) so that the resulting velocities u* and u* axe also 
equal: 

U^ip*;qt) = U^ip*-,Qr) . 

This is a single nonlinear equation for p* which can be solved by Newton’s 
method, for example. Then the intermediate velocity u* can be determined by 
this common value of The densities Qi and g* are also determined 

from the parameterization of states based on pf = p*. In general pf ^ p*, 
but since any jump in density is allowed by the Hugoniot condition across 
the contact discontinuity, we have solved the problem. 

Figure 3.13 shows the solution to a typical Riemann problem in the 
velocity-pressure plane, for a shock tube problem where the initial veloc- 
ity is zero everywhere, initial density is 1 everywhere, and there is a jump 
in pressure from 100 to 20. The resulting intermediate state q* is connected 
to qi by a left-going rarefaction wave and connected to qr by a right-going 
shock, similar to what was seen in Fig. 2.1. 

For the adiabatic Euler equations, these curves are often called the adia- 
bats. In Sect. 9.5 we will see that these adiabats change if the gas is moving 
at relativistic speeds. 




Fig. 3 . 13 . Intersection of loci connecting qi to qr through a 1— rarefaction and a 
3— shock for pi = 100, gi = l^ui = 0 and pr = 20, pr = 1, = 0 



Godunov first proposed a numerical method based on the solution of 
Riemann problems and presented one iterative method for computing p* in 
his paper, [98] (also described in §12.15 of [192]). Chorin[52] describes an 
improvement of this method. More details on the solution of the Riemann 
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problem can also be found in §81 of [60]. Solving the Riemann problem for 
non-ideal gases can be more difficult. See [57], for example. 



4. Numerical Methods in One Dimension 



4.1 Finite Diflference Theory 



For students not familiar with numerical methods, it may be useful to review 
some of the fundamental ideas and issues that arise in discretizing differential 
equations, before going into more detail on hyperbolic problems. This section 
contains a very brief overview of these issues in the context of finite difference 
discretization of ODE’s and linear PDE’s, More details in the same style 
can be found in [142]. Many elementary texts are available that cover this 
material in depth, e.g., [115], [169], [192], [232], The basic ideas in the 
context of hyperbolic systems are also discussed in [141]. 

After introducing some concepts in the context of finite difference meth- 
ods, we will then turn to the finite-volume viewpoint that is often more 
valuable in developing methods for conservation laws. 

We will generally consider finite difference grids with mesh spacing h = 
Ax ~ Ay = Az in all spatial directions for simplicity, and for time-dependent 
equations the time step will be denoted hy k — At. Subscript indices will refer 
to spatial grid points while superscript indices refer to the time step, so for 
example 



Q'tj ~ Q{^i,yj,tn) ■ 



With a finite difference method, we discretize the derivatives appearing 
in the differential equation so that we obtain a system of algebraic relations 
between (approximate) values at all the grid points. We can then solve this 
finite set of equations for the discrete grid-point values on the computer. 

As examples of finite difference methods, consider the advection equation 
(2.10). There are many different ways to discretize the derivatives, leading to 
different finite difference methods. As one example, using one-sided approxi- 
mations to both qt and Qx gives 



1 I “ 



or 



-Q? . .. fQ?+i - Q? 

t I u 



= 0 



= 0 . 



(4.1) 

(4.2) 



Which of these is better to use depends on the velocity u. If u > 0 then 
the fluid flows from left to right and we should use (4.1), which updates 
based on the value to the left (in the upstream or upwind direction). If 
u < 0 then (4.2) is better. These methods are called the first-order upwind 
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methods. They can be rewritten to give an explicit formula for in 

terms of data at the previous time step, e.g., (4.1) gives 



Qr' =Q^-lu{Q'l-QU)- 



(4.3) 



These methods are easily extended to systems of equations of the form (1-4), 

e.g., 

= Q^-\a{Q'1-QU)- (4.4) 

This one-sided form would be appropriate only if all eigenvalues of A are 
positive. 

To achieve second-order accuracy we could use centered approximations 
to the derivatives, e.g., 



Qr-Q 'i 

2k 



n — 1 



+ A 



2h 



which, gives the leapfrog method 

Another approach is based on the Taylor series expansion 
q{x, t + k)= q{x, t) + kqt{x, t) + (a:, t) + ■ ■ ■ 

and the observation that from qt = —Aqx we can compute 
Qu ~ ~Aqxt ~ ~Aqtx ~ ~A{—Aqx)x — A qxx 



(4.5) 



(4.6) 



(4.7) 

(4.8) 



so that (4.7) becomes 

q{x,t + k) = q{x,t) - kAqx(x,t) + ^k‘^A‘^qxx{3:,t) H . (4.9) 

The Lax-WendrofF method then results from retaining only the first three 
terms of (4.9) and using centered difference approximations for the derivatives 
appearing there: 

=Q7- - Qi-i) + ^^'(^"+1 - + Q?-i) ■ ( 4 . 10 ) 

For conservation laws arising from physical principles, it is often important 
to insure that the numerical methods also conserve the appropriate quantities, 
such as mass, momentum, and energy. This is particularly true when the 
solution involves shock waves, as discussed in Sect. 4.3. For conservation laws 
it is thus often preferable to use a finite volume method rather than a finite 
difference method. In a finite volume method the discrete value Qf is viewed 
as an approximation to the average value of q over a grid cell rather than 
as an approximation to a point wise value of q. The cell average is simply 
the integral of q over the cell divided by its area, so conservation can be 
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maintained by updating this value based on fluxes through the cell edges. 
This is described in detail starting in Chapter 4,2. Although the derivation of 
such methods may be quite different from that of finite difference methods, 
the resulting formulas may be identical. We will see, for example, that the 
upwind and Lax-Wendroff methods are easily interpreted as finite volume 
methods. 

The main issues that should be understood in order to effectively develop 
and use finite-difference methods are briefly outlined below. 

4.1.1 Convergence and Global Order of Accuracy. As we refine the 
grid we hope to get better approximations to the true solution. The difference 
between the true and computed solution is called the global error and the 
method is convergent if this goes to zero as the mesh is refined. The method 
has (global) order p if the global error is 0{h^ + k^) as h, A: — 0. In analyzing 
a method it is generally impossible to get an expression for the global error 
directly, however, since the system of equations to be solved grows without 
bound in size as we refine the grid. 

4.1.2 Consistency and the Local Truncation Error. It is much easier 
to compute the local truncation error (LTE), which is a measure of how 
well the difference equations model the differential equation locally. This is 
defined by inserting the true solution into the difference equations at a single 
arbitrary point in spacetime. We don’t generally know the true solution, but 
if it is a smooth function then we can expand in Taylor series and use the 
differential equation to determine an expansion for the LTE, which shows 
the local order of the method. If the LTE vanishes like 0{h^ H- k^) then the 
method is (locally) p’th order acciu*ate. If the method is at all reasonable 
then we expect the LTE to at least go to zero as the mesh is refined, in which 
case the method is called consistent. 

4.1.3 Stability. We hope that a method which is locally p’th order accurate 
will also give a global error that is p’th order, and in particular will be 
convergent. But this is far from obvious and may not hold. The problem 
is that as we refine the grid we must compute an increasing number of values 
to solve the discrete system, and unless the discretization is stable in some 
way then this process may not converge even if the method is consistent. For 
convergence we generally need both consistency and stability, and stability 
is by far the harder concept to deal with. 

For partial differential equations there is typically some restriction on 
the relation between the time step k and the spatial grid size h. Both the 
Lax-Wendroff method (4.10) and the leapfrog method (4.5), for example, 
are stable only if \Xk/h\ < 1 for all eigenvalues A of the matrix A. This is 
motivated by the discussion of the CFL condition in Sect. 4.1.10 below. The 
upwind method (4.4) is stable only if 0 < AA://i < 1 and in particular can only 
be used if A > 0 for all eigenvalues of A (which also follows from the CFL 
condition). By contrast the method with one-sided differences in the other 
direction is stable only if — 1 < Xk/h < 0 for all A. Godunovas method gives 
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the proper generalization of upwind methods to problems where information 
flows in both directions. See [142] or other references for a more complete 
discussion of stability. 

4.1.4 One-Step and Multistep Methods. For time-dependent problems 
we typically use time-marching procedures, in which the solution at all points 
X E at one time in are used to compute the solution at the next time 

:= tn + k. Such a methods is called a one-step or two-level method. One 
can also use multistep methods that base the solution at time on several 
past levels, but in more than one space dimension the storage requirements 
of multistep methods are often prohibitive. For nonlinear conservation laws 
the standard high-resolution methods are all one-step methods and we will 
concentrate on these. 

4.1.5 Implicit and Explicit Methods. An explicit method is one which 

yields an explicit expression for each value at time tn+i in terms of 

nearby values at time tn^ All of the methods introduced above for hyperbolic 
problems are explicit. 

An implicit method couples together values at different grid points at 
time tn-\-i and hence an algebraic system of equations must be solved in each 
time step in order to advance the solution. For example, the one-dimensional 
heat equation qt = Qxx might be solved by the explicit method 

=Q? + ^iQti - 2Qr + Q?+i) (4-11) 

or by the implicit method 

(4.12) 

which is known the Crank— Nicolson method. To determine the values 
we must use these equations, together with the boundary conditions 
at each end of the interval, to set up and solve a tridiagonal linear system 
of equations in each time step. In one space dimension such a system can 
be solved very efficiently by Gaussian elimination and this is not much more 
work than an explicit method. In more than one dimension one obtains a 
large sparse linear system that has structure similar to what arises from 
discretization of an elliptic equation, although since the solution changes by 
only 0{k) in each time step this can be more easily solved than an elliptic 
equation. 

Explicit methods are generally preferable if they can be used with the 
time step that seems reasonable for the problem, which depends on how 
fine the spatial grid is and how smooth the solution is. For many problems, 
however, an explicit method turns out to be unstable unless the time step is 
orders of magnitude smaller than what seems reasonable based on accuracy 
considerations. This is the case with the heat equation, where the method 
(4.11) turns out to be stable only if A: < whereas (4.12) is stable for any 
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time step. If the solution is varying smoothly then we expect variations in 
time to be on roughly the same scale as variations in space and so we would 
typically like to take a time step k ^ h. This is possible with the implicit 
Crank-Nicolson method whereas the explicit method (4.11) would require 
taking k smaller than this by a factor of h. 

For a nonlinear differential equation an implicit method will give rise to 
a nonlinear algebraic system to solve in each time step which may be much 
more expensive than an explicit method with the same time step. However, 
in situations where an explicit method would force us to take much smaller 
time steps than desired, the use of an implicit method may be much more 
efficient. Such problems are called stiffs as discussed next. 

4.1.6 Stiffness. For time-dependent problems the idea of stiffness is fun- 
damental as it often affects what sort of method must be used. Roughly 
speaking, a problem is said to be stiff if there are a variety of different time 
scales in the solution and we are attempting to model some phenomenon on 
a relatively slow time scale, in a situation where the faster processes simply 
maintain local equilibrium but all visible variations are on the slower scale. 
We would then like to take a time step that is reasonable relative to the active 
time scales, and not have to use the much smaller time step that would be 
necessary if we were modeling activity on the faster time scales. Since these 
time scales may easily vary by a factor of a million or more for practical 
problems, this is a major consideration in the efficiency of the method. The 
problem is that a random perturbation of the solution may activate these fast 
processes, and numerical methods are constantly introducing perturbations 
in the solution. Unless we use a method designed for stiff problems, we will 
have to take small time steps appropriate for the fastest time scales in order 
to maintain stability. Methods for stiff problems are necessarily implicit in 
time. 

4.1.7 Stiff ode’s. The classic example is the case of a stiff ordinary dif- 
ferential equation. See, for example, [131], [142], or any text on numerical 
methods for ordinary differential equations. 

As an example, consider the ODE 

u\t) = X(cost - u) - sin t . (4-13) 

One particular solution is the function u{t) = cost, and this is the solution 
with the particular choice of initial data u(0) = 1. This smooth function is a 
solution for any value of A. If we consider initial data of the form u{to) = r/ 
that does not lie on this curve, then the solution through this point is a 
different function, of course. It is easy to verify that the solution is 

u{t) = — cos(io)) + cosi . (4.14) 

However, if A < 0 (or Re(A) < 0 more generally), this function approaches 
cost exponentially quickly, with decay rate A. Figure 4.1 shows a number of 
different solution curves for this equation with different choices of to and t], 
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Fig. 4.1. Solution curves for the ODE (4.13) for various initial values. Left: With 
A = — 1. Right: With A — —10 and the same set of initial values 




with the fairly modest value A = —1. Figure 4.1b shows the corresponding 
solution curves when A = -10, 

Stiffness causes numerical difficulties because any finite difference method 
is constantly introducing errors. The local truncation error acts as a pertur- 
bation to the system that moves us away from the smooth solution we are 
trying to compute. Why does this cause more difficulty in a stiff system than 
in other systems? At first glance it seems that the stiffness should work to 
our advantage. If we are trying to compute the solution u{t) = cost to the 
ODE (4.13) with initial data w(0) = 1, for example, then the fact that any 
errors introduced decay exponentially should help us. The true solution is 
very robust and the solution is almost completely insensitive to errors made 
in the past. In fact this stability of the true solution does help us, as long as 
the numerical method is also stable. 

The difficulty arises from the fact that many numerical methods, including 
all explicit methods, are unstable (in the sense of absolute stability) unless 
the time step is small relative to the time scale of the rapid transient^ which 
in a stiff problem is much smaller than the time scale of the smooth solution 
we are trying to compute. Although the true solution is smooth and it seems 
that a reasonably large time step would be appropriate, the numerical method 
must always deal with the rapid transients introduced in every time step and 
may need a very small time step to do so stably. 

For systems of ODE’s, stiffness is sometimes defined in terms of the “stiff- 
ness ratio” of the system, which is the ratio 

max |Ap| 
min|Ap| 

over all eigenvalues of the Jacobian matrix f'{u). If this is large then there is 
a large range of time scales present in the problem, a necessary component 
for stiffness to arise. While this is often a useful quantity, one should not 
rely entirely on this measure to determine whether a problem is stiff. For one 
thing, it is possible even for a scalar problem to be stiff (as we have seen in the 
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example above), even though for a scalar problem the stiffness ratio is always 
1 since there is only one eigenvalue. There can still be more than one time 
scale present. In (4.13) the fast time scale is determined by A, the eigenvalue, 
and the slow time scale is determined by the inhomogeneous term sin{t). For 
systems of equations there may also be additional time scales arising from 
inhomogeneous forcing terms or other time-dependent coefficients that are 
distinct from the scales imposed by the eigenvalues. 

4.1.8 Parabolic Equations. Stiffness can be seen in partial differential 
equations as well, and in fact the need for an implicit method such as Crank- 
Nicolson for the heat equation can be interpreted in terms of stiffness. By 
Fourier analysis we can analyze the effect of the linear heat equation on 
arbitrary data by decomposing it into Fourier modes. Sinusoidal initial data 
of the form sin(^x) will decay exponentially at the rate e“^ ^ leading to a 
very rapid damping of high frequencies. In solving the heat equation with 
smooth data (only low frequencies present) we have relatively slow variation 
in time leading us to desire a time step A: « /i. But because any high frequency 
perturbation of the solution leads to rapid transient behavior, the equations 
are stiff and an implicit method must be used. 

Note that the time scale for the heat equation can be arbitrarily fast by 
taking a sufficiently large value of ^ in the perturbation. This might lead 
one to expect the discrete equations would be “infinitely stiflF”, so that an 
explicit method would be unstable for any positive time step. However, when 
we discretize in space on a particular grid we limit the frequencies that can 
be represented so that ^ cannot be arbitrarily large. The largest frequency 
visible on the grid has ^ ^ 1/h and hence decay rate l//i^ which explains the 
k limit on the time step for an explicit method. If we take a time step 
that is too large with an explicit method, then an instability will typically 
appear in the form of an exponentially growing high-frequency oscillation, 
typically at the highest frequency that can be represented on the grid since 
this gives the most rapid growth with an unstable method. 

Parabolic terms arise from dissipative effects such as fluid viscosity, heat 
conduction, or finite resistivity in MHD, and if such terms are included then 
implicit methods must typically be used to obtain efficient solutions. This is 
one reason that we prefer to solve hyperbolic systems of conservation laws in 
which these dissipative terms are ignored whenever possible. 

4.1.9 Hyperbolic Equations. Hyperbolic systems are usually not stiff. 
The time step k for an explicit method is typically limited by something pro- 
portional to Smax^, where Smax is the maximum wave speed in the problem. 
(See Sect. 4.1.10 below.) Provided some disturbance of interest is propagating 
at this maximum speed, then this is generally the right size time step to use 
in order that the temporal resolution is consistent with the spatial resolution. 
In this case, there is no need to use an implicit method. 

In some problems, however, the fastest waves may be physically unimpor- 
tant. For example in atmospheric flow acoustic waves are typically unimpor- 
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tant, and since these travel much faster than the phenomena of interest the 
use of compressible gas dynamics equations would lead to stiffness. Instead 
other systems are typically used that do not model acoustic waves. 

In “incompressible” flow, as is often assumed of liquids, for example, the 
acoustic waves are filtered out by the introduction of an elliptic problem that 
is solved for the pressure, as described in Sect. 8.6.1. This equation couples 
all grid points together, so that solving the elliptic equation is tantamount to 
using an implicit method to model the net effect of the fast acoustic waves 
that have been eliminated from the system. This is relatively efficient since the 
elliptic equation is linear, and the remaining nonlinear hyperbolic equations 
for the advection of momentum can then be solved with an explicit method. 

Nonlinear hyperbolic equations may also exhibit stiffness if the wave 
speeds differ by orders of magnitude at different points in the domain, as can 
happen with a nonlinear problem since the wave speeds depend on q{x^t). 
With an explicit method the time step may be limited by the fastest wave 
speed seen anywhere in the domain, while the phenomenon of interest may 
be in a region with much slower wave speeds. This often happens in MHD 
calculations. 

Stiffness also arises when gas dynamics is coupled together with source 
terms that model radiative transfer or chemical or nuclear reactions that 
may have much faster time scales than the fluid dynamics. This is discussed 
in Sect. 5.4. 

4.1.10 The CFL Condition. The CFL condition is named after Courant, 
Friedrichs, and Lewy, who wrote a fundamental paper in 1928 that was es- 
sentially the first paper on the stability and convergence of finite difference 
methods for partial differential equations [61]. 

To understand this general condition, we must discuss the domain of de- 
pendence of a time-dependent PDE. For the advection equation (1.5), the 
solution q{X,T) at some fixed point (X^T) depends on the initial data 
q^{x) = q{x,0) at only a single point: q{X,T) — q{X - uT). We say that 
the domain of dependence of the point {X,T) is the point X - uT: 

V{X,T) = {X~uT} . 

If we modify the data q^ at this point then the solution q{X^T) will change, 
while modifying the data at any other point will have no effect on the solution 
at this point. 

This is a rather unusual situation for a PDE. More generally we might 
expect the solution at (X, T) to depend on the data at several points or over a 
whole interval. For the linear hyperbolic system qt~\~Aqx = 0, the propagation 
speeds are given by the eigenvalues of A, Ai , A 2 , . . . , Xm- If these values are 
distinct then we will see that the solution g(X,T) depends on the data at 
the m distinct points X — X\T, . . ., X — A^T and hence 

V(X, T) = {X - ApT, for p = 1, 2, . . . , m} . 
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The heat equation qt = q^x has a much larger domain of dependence. For 
this equation the solution at any point (X, T) depends on the data everywhere 
and the domain of dependence is the whole real line, 

V{X,T) = (- 00 , 00 ) . 

This equation is said to have infinite propagation speed, since data at any 
point is felt everywhere at any small time in the future (though its effect 
decays exponentially away from this point). 

A finite difference method also has a domain of dependence. On a partic- 
ular fixed grid we define the domain of dependence of a grid point (xj,fn) 
to be the set of grid points X{ at the initial time t = 0 with the property 
that the data at Xi has an effect on the solution For example, with 
the Lax-Wendroff method (4.10) or any other 3-point method, the value 
depends on These values depend in turn on Q^l 2 

through Qj +2 ‘ Trsicing back to the initial time we obtain a triangular array 
of grid points as seen in Fig. 4.2(a), and we see that depends on the initial 
data at the points Xj — ^ xj^fi. 




Fig. 4.2. (a) Numerical domain of dependence of a grid point when using a 3-point 
explicit method, (b) On a finer grid 



Now consider what happens if we refine the grid, keeping k/h fixed. Fig- 
ure 4.2(b) shows the situation when k and h are reduced by a factor of 2, 
focusing on the same value of (A, T) which now corresponds to on the 
finer grid. This value depends on twice as many values of the initial data, but 
these values all lie within the same interval and are merely twice as dense. 

If the grid is refined further with kfh~r fixed, then clearly the numerical 
domain of dependence of the point (A, T) will fill in the interval [X-T/r, X+ 
T/r]. As we refine the grid, we hope that our computed solution at (X,T) 
will converge to the true solution q{X,T) = q^{X - uT). Clearly this can 
only be possible if 

X -T/r <X -uT <X -\-T/r . (4.15) 

Otherwise, the true solution will depend only on a value q^{X - uT) that 
is never seen by the numerical method, no matter how fine a grid we take. 
We could change the data at this point and hence change the true solution 
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without having any effect on the numerical solution, so the method cannot 
be convergent for general initial data. 

Note that the condition (4.15) translates into |u| < 1/r and hence 
\uk/h\ < 1. For a hyperbolic system with eigenvalues A, we need |AA://i| < 1 
as a necessary condition for stability. 

The CFL condition generalizes this idea: 

The CFL Condition: A numerical method can be convergent only if its 
numerical domain of dependence contains the true domain of dependence of 
the PDEj at least in the limit as k and h go to zero. 

It is important to note that in general the CFL condition is only a neces- 
sary condition. If it is violated then the method cannot be convergent. If it is 
satisfied, then the method might be convergent, but a proper consistency and 
stability analysis is required to prove this and determine the proper stability 
restriction on k and h. 

For the heat equation the true domain of dependence is the whole real 
line. It appears that any 3-point explicit method violates the CFL condition, 
and indeed it does if we fix fc//i as the grid is refined. However, it can be 
shown that the 3-point explicit method (4.11) is convergent as we refine the 
grid provided we have k/h"^ < 1/2. In this case when we make the grid finer 
by a factor of 2 in space it will become finer by a factor of 4 in time, and 
hence the numerical domain of dependence will cover a wider interval at time 
i = 0. As k 0 the numerical domain of dependence will spread to cover 
the entire real line, and hence the CFL condition is satisfied in this case. 

An implicit method such as the Crank-Nicolson method (4.12) satisfies 
the CFL condition for any time step k. In this case the numerical domain 
of dependence is the entire real line because the tridiagonal linear system 
couples together all points in such a manner that the solution at each point 
depends on the data at all points (i.e., the inverse of a tridiagonal matrix is 
dense). 

4.2 Finite Volume Methods 

In the previous sections we reviewed the basic theory of finite difference 
methods for linear hyperbolic problems. In this section, these methods will 
be reinterpreted as finite volume methods, a viewpoint that is important in 
properly extending these methods to nonlinear conservation laws. 

Many high-resolution methods for shock capturing are based on solving 
Riemann problems between states in neighboring grid cells. In this chapter 
we will develop one particular set of methods of this type. The development 
of such methods has a long and rich history, and numerous related methods 
can be found in the literature. Books such as [2], [86], [97], [111], [130], 
[141], [178], [185], [226], [242] contain descriptions of these methods and 
pointers to the literature. 




4. Numerical Methods 



53 



Rather than viewing as an approximation to the single value ^n), 
we will now view it as approximating the average value of q over an interval 
of length h = Ax = {b — a)/N, We will split the physical domain [a, 6] into 
N intervals denoted by 



Ci — ) 



where now Xi = a -h (i — l)/i. The value will approximate the average 
value over the t’th interval at time tn- 




(4.16) 



Notationally it might be better to denote the endpoints of the i’th interval 
by Xj_i /2 and Xi_|_i/ 2 , which would be more symmetric and remind us that 
Qf is an approximation to the average value between these points. However, 
the formulas are less cluttered if we stick to integer subscripts. 

If q{x^t) is a smooth function, then the integral in (4.16) agrees with the 
value of q at the midpoint of the interval to O(h^). By working with cell 
averages, however, it is easier to use important properties of the conservation 
law in deriving numerical methods. In particular, we can insure that the 
numerical method is conservative in a way that mimics the true solution, 
and this is extremely important in accurately calculating shock waves. This 
is because h Q? approximates the integral of q over the entire interval 
[a, 6], and if we use a method that is in conservation form (as described 
below), then this discrete sum will change only due to fluxes at the boundaries 
X = a and x = b. The total mass within the computational domain will be 
preserved, or at least will vary correctly provided the boundary conditions 
are properly imposed. 

The integral form of the conservation law (2.18), when applied to one grid 
cell over a single time step, gives 



r r r^n+i 

/ q{x,t„+i)dx - / q{x,tn)dx = / f{q{xi,t))dt 

JCi JCi Jt„ 

Rearranging this and dividing by h gives 
i J q{x,tn+i)dx = ^ J q{x,t„)dx 



(4.17) 



f /(«(a:i, ^)) ^ f{q{xi+i,t))dt 



This tells us exactly how the cell average of q from (4.16) should be updated 
in one time step. In general, however, we cannot evaluate the time integrals 
on the right-hand side of (4.17) exactly since q{xi^t) varies with time along 
each edge of the cell, and we don’t have the exact solution to work with. 
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But this does suggest that we should develop numerical methods in the flux- 
differencing form 




where is some approximation to the average flux along x ~ Xi\ 



1 



If we can approximate this average flux based on the values then we will 
have a fully-discrete method. 

Since information propagates with finite speed, it is reasonable to first 
suppose that we can obtain based only on the values 
cell averages on either side of this interface. Then we might use a formula of 
the form 



Fr = F{QU,Q7), 

where F is some numerical flux function. The method (4.18) then becomes 

= Q? - ’^{F{Q7,Q7+,) - F{Qtr,Q7)) . (4.20) 

The specific method obtained depends on how we choose the formula F, 
but in general any method of this type is an explicit method with a 3-point 
stencil. Moreover, it is said to be in conservation form, since it mimics the 
property (4.17) of the exact solution. Note that if we sum from (4.18) 

over any set of cells we obtain 

^ ^ k 

hY^Qr}+^ = hY,Q7-T - F?) . (4.21) 

i=/ i~l 

The sum of the flux differences cancels out except for the fluxes at the extreme 
edges. Over the full domain we have exact conservation except for fluxes at the 
boundaries. (Boundary conditions are discussed at the end of this chapter.) 

Note that (4.20) can be viewed as a direct finite difference approximation 
to the conservation law qt -f f{q)x ~ 0, since rearranging it gives 

, F{Q'i^,,Q-)-F{Q’y,QU) n 

k h 

Many methods can be equally well viewed as finite difference approxima- 
tions to this equation or as finite volume methods. In obtaining a method 
in conservation form, the above discussion suggests that we should always 
discretize the conservation law in this form, rather than in the quasi-linear 
form qt + f'{q)qx — 0, for example. 
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4.3 Importance of Conservation Form — Incorrect Shock Speeds 



Using methods in conservation form is particularly important when solving 
problems with shocks or other discontinuities in the solution, as a noncon- 
servative method may give a numerical solution that looks reasonable but is 
entirely wrong. For example, Burgers’ equation 



Qt + 




(4.22) 



can be discretized by the upwind conservative method 



Figure 4.3(a) shows the resulting numerical solution for a shock wave resulting 
between states 1.2 and 0.4 propagating at speed 0.8. The numerical solution 
is slightly smeared about the correct location. 
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Fig. 4.3. (a) True and computed solutions to Burgers’ equation using a conservative 
upwind method, (b) True eind computed solutions to Burgers’ equation using a 
nonconservative upwind method 



On the other hand if we discretize the quasilinear form of Burgers’ equa- 
tion 

qt + qqx=0 (4.23) 

using the nonconservative upwind method 

= Q- - - Q?-i) 

we obtain the results seen in Fig. 4.3(b), The shock is moving at the wrong 
speed! This happens because the equations (4.22) and (4.23) are equivalent 
for smooth solutions but not for problems with shock waves. This example 
is discussed in more detail in [141]. 
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4.4 Numerical Flux Functions 

Given that we want to use a method in conservation form, how should we 
define F{qi,qr)j the average flux at a point based on data qi and qr to the 
left and right of this point? A first attempt might be the simple average 

Fiqi,qr) = \{fiqi)+f{qr)). 

Using this in (4.20) would give 

In general, however, this method turns out to be unconditionally unstable for 
any value of k/h. 

If we instead use the modified flux 

F(qi,qr) = ^ifiqi) + f(qr)) ~ - qi) , (4.24) 

then we obtain the Lax-Friedrichs method, 

+ QiVi) - - fiQti)) ■ (4.25) 

Note that the additional term we have added in (4.24) is a diffusive flux 
based on an approximation to ^qx^ ^nd hence this modification amounts to 
adding some artificial viscosity to the centered flux formula. 

4.5 Godunov’s Method 

Many of the methods we will explore in detail are based on solving the Rie- 
mann problem between the states qi and qr in order to define the numerical 
flux F{qi,qr). To see how this comes about, it is useful to view the data 
at time t^ as defining a piecewise constant function which has the 

value for all x in the interval C{. Suppose we could solve the conservation 
law exactly over the time interval [in, in+i] with initial data g’^(x,i„). Call 
the resulting function q^{x,t) for tn < t < in+i- Then we might consider 
defining the numerical flux Fp in (4.18) by 

= (4.26) 

This integral is trivial to compute compared to the integral (4.19), at least 
provided the time step k is small enough, because of the fact that with piece- 
wise constant initial data we can find the exact solution easily by simply 
piecing together the solutions to each Riemann problem defined by the jump 
at each interface. Figure 4.4 illustrates this for the case of a linear hyperbolic 
system of 2 equations. 
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Fig. 4.4. Solving the Riemann problems at each interface for Godunovas method, 
(a) With Courant number less than 1/2 there is no interaction of waves, (b) With 
Courant number less than 1 the interacting waves do not reach the cell interfaces, 
so the fluxes are still constant in time 



The crucial fact now is that the solution to the Riemann problem at X{ is 
a similarity solution, which is constant along each ray {x — Xi)/t =constant. 
In general, let q*(qi^qr) denote the exact solution to the Riemann problem 
along the ray x/t = 0, obtained when we use data 

q(x, 0 ) = I 
Then we have 



qi if X < 0 

qr if X > 0 , 



r{xi,t)=q-{Qtr,Q7) 



(4.27) 



for all t € [tn,tn-\-i], provided that the time step is small enough that waves 
from the Riemann problems do not travel farther than distance h in this time 
step. If this condition is violated, then the value along x = x^ may change 
after waves from neighboring Riemann problems pass this point. For a linear 
system, the maximum wave speed is maxp |A^|, where are the eigenvalues 
of A, so this condition requires that 



A:max|A^| < h . 

p 



We recognize this as being simply the CFL condition for a 3-point method, 
a condition which we know must be satisfied anyway for stability. 

The method obtained by the procedure outlined above is known as Go- 
dunov’s method, and was introduced in [98] as an approach to solving the 
Euler equations of gas dynamics in the presence of shock waves. 

For the simplest case of scalar advection, solving the Riemann problem 
between states qi and qr gives 



q*{qh 




qi 

qr 



if n > 0 
if w < 0 . 



If w = 0 then q* is not well defined, as the discontinuity is stationary along 
the ray xjt — 0. This is no cause for concern, however, since all we really 
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require is the flux value f{q*) = uq*, and if u — 0 then f{q*) = 0 regardless 
of how we define q*. So we obtain the numerical flux 



1-./ \ */ X i '^qi if u < 0 

F{m,qr)=uq {qi,qr) = ^ 

This can also be written in the compact form 

F{qi,qr) = u'^qi + u~qr (4.28) 



using the notation 

= max{u,0) , u~ = min(t(,0) . (4.29) 

Using this in the conservative method (4.20) gives the upwind method. 
Note that solving the Riemann problem at the interface gives a flux that is 
defined by the value of on the upwind side of the interface, so that the 
method reduces to one-sided differencing in the proper direction. The method 
takes the form 



r)n+i = l 

' IQ” - f «(Q"+i - Qi) if « < 0 



(4.30) 



as introduced in (4.1) and (4.2). 

This method is easily generalized to nonlinear systems if we can solve the 
nonlinear Riemann problem at each cell interface, and this method gives the 
natural generalization of the first-order upwind method to general systems 
of conservation laws. 

Recall that Qf represents an approximation to the cell average of q{x,tn) 
over cell Ci, 




and the idea is to use the piecewise constant function defined by these cell 
values as initial data q'^{x,tn) for the conservation law. Solving over time k 
with this data gives a function q^{x,tn-\-i) that is then averaged over each 
cell to obtain 

q^{x,tn+i)dx , (4.31) 



If the time step k is sufficiently small, then the exact solution g^(x, t) can be 
determined by piecing together the solutions to the Riemann problem arising 
from each cell interface, as indicated in Fig. 4.4(a). 

Recall from Sect. 4.5 that we do not need to perform the integration in 
(4.31) explicitly, which might be difficult since g”(x,in+i) niay be very com- 
plicated as a function of re. Instead, we can use the fact that q^{xi,t) is 
constant in time along each cell interface so that the integral (4.26) can be 
computed exactly. Hence the cell average is updated by (4.20) with 



F{qi,qr) = f{q*{qi,qr)) , 



(4.32) 
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where q*{qhqr) is the solution to the Riemann problem between qi and qr, 
evaluated along x/t = 0. 

In Fig. 4.4(a) the time step was taken to be small enough that there was 
no interaction of waves from neighboring Riemann problems. This would be 
necessary if we wanted to construct the solution at q^{x,tn-\-i) in order to 
explicitly calculate the cell averages (4.31). However, in order to use the flux 
formula (4.32) it is only necessary that q^{xi,t) remain constant in time 
over the entire time step, which allows a time step roughly twice as large, 
as indicated in Fig. 4.4(b). If Smax represents the largest wave speed that is 
encountered then on a uniform grid with the cell interfaces distance h apart, 
we must require 

< 1 (4.33) 

h 

in order to insure that the formula (4.32) is valid. Note that this is pre- 
cisely the CFL condition required for stability of this 3-point method, as 
discussed in Sect, 4.1.10. In general SmaxA://i is called the Courant num- 
ber, Figure 4.4(a) shows a case where the Courant number is less than 1/2 
while Fig. 4.4(b) shows the Courant number close to 1. Note that for a linear 
system of equations, Smax = maxplA^I and this agrees with our previous 
definition of the Courant number. 

4.5.1 Godunov’s Method on Scalar Equations. On a convex scalar 
equation with f\q) an increasing function of the solution to the Riemann 
problem between qi and qr is either a shock traveling at speed s = [f]l[q\ 
(if qi > qr) or a rarefaction wave (if qi < qr) bounded by x/t = f\qi) on 
the left and x/t = f'{qr) on the right. Five possible configurations in the x-t 
plane are shown in Fig. 4,5, In most cases the solution q* along x/t ~ 0 will 
be either qr (if the solution is a shock or rarefaction wave moving entirely 
to the left. Fig. 4.5(a) or (b)), or qi (if the solution is a shock or rarefaction 
wave moving entirely to the right. Fig. 4.5(d) or (e)). 




Fig. 4.5. Five possible configurations for the solution to a scalar conservation law, 
in the x-t plane, (a) Left-going shock, q* = (b) Left-going rarefaction, q* = qr^ 

(c) Transonic rarefaction, q* = go- (d) Right-going rarefaction, q* = qi. (e) Right- 
going shock, q* = qi 



The only case where q* has a different value than qi or qr is if g/ < go < 
where go is the value for which /'(go) = 0- This is called the stagnation 
point since the value go propagates with speed 0, It is also called the sonic 
point since in gas dynamics the eigenvalue u± c takes the value 0 only 
when the fluid speed is equal to the sound speed. The solution to the Rie- 
mann problem in this case, shown in Fig. 4.5(c), consists of a rarefaction wave 
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that is partly left-going and partly right-going. This is called a transonic 
rarefaction since in gas dynamics the fluid is accelerated from a subsonic 
velocity to a supersonic velocity through such a rarefaction. In a transonic 
rarefaction the value along x/t = 0 is simply qq. 

We thus see that the Godunov flux function for a convex scalar conserva- 
tion law is 

( fiqi) if qi > qo and s > 0 

f{qr) if qr < 9 o and s < 0 (4.34) 

f{qo) if qi <qo <qr ■ 

4.6 Approximate Riemann Solvers 

To apply Godunov’s method on a system of equations we need only determine 
Q*{QhQr)i the state along xjt = 0 based on the Riemann data qi and 
We do not need the entire structure of the Riemann problem. However, to 
compute q* we must typically determine the full wave structure and wave 
speeds in order to determine where q* lies in state space. Typically it is one 
of the intermediate states in the Riemann solution obtained in the process of 
connecting q/ to qr by a sequence of shocks or rarefactions, and is one of the 
intersections of Hugoniot loci and/or integral curves. In the special case of a 
transonic rarefaction the value q* will lie along the integral curve somewhere 
between these intersections, and additional work will be required to find the 
value q*. 

A wide variety of approximate Riemann solvers have been proposed 
that can be applied much more cheaply than the exact Riemann solver and yet 
give results that in many cases are equally good when used in the Godunov or 
high-resolution methods. In this section we will look at several possibilities. 

4.6.1 The All-Shock Solver. In solving a nonlinear Riemann problem we 
must worry about whether the wave in each family should be a shock or 
rarefaction so that we know whether to use the Hugoniot locus or integral 
curve correspondingly. One simplification that can be made to the Riemann 
solver is to ignore the possibility of rarefaction waves and simply find a Rie- 
mann solution in which each pair of states is connected along the Hugoniot 
locus. The solution then consists entirely of discontinuities that satisfies the 
Rankine-Hugoniot conditions and is a weak solution of the conservation law. 
This approach is discussed by Colella[54] for gas dynamics. The all-shock 
solver is particularly valuable for problems where the Riemann problem is 
harder to solve, such as in problems where a more complicated equation of 
state than a gamma-law gas must be used. This occurs at high temperatures, 
or in relativistic flow, for example. 

A potential problem with this approach, of course, is that by using a 
solution to the Riemann problem that does not satisfy the entropy condi- 
tion, we might obtain a numerical solution which does not approximate the 
correct weak solution. Actually, however, in most cases (except for transonic 
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rarefactions), Godunov’s method with the Riemann solver will typically work 
well even if the correct solution involves rarefaction waves. This is because 
of the numerical dissipation that is introduced in every step of Godunov’s 
method through the averaging process. Although the discontinuous solution 
used in the solution of a particular Riemann problem may not be correct, by 
averaging this solution over the grid cell at the end of the time step the discon- 
tinuity is smeared out and after many time steps a good approximation to the 
correct rarefaction wave will be computed. The all-shock approximation will 
typically be inadequate in the case of a transonic rarefaction. Such a disconti- 
nuity tends to persist, and results in the computation of an entropy- violating 
weak solution. An “entropy fix” is often used to eliminate this problem, as 
mentioned in Sect. 4.6.4 

4.6.2 The Osher Solver. Another approach would be to take the opposite 
viewpoint and use only the integral curves of the eigenvectors to solve the 
Riemann problem, completely ignoring the Hugoniot locus and the possibility 
of shocks. This has the advantage over the previous approach that rarefaction 
waves, including transonic rarefactions, will then be incorporated. 

There is, however, an obvious difficulty with this approach. The smooth 
solution obtained by integrating along the integral curves is only physically 
realizable if the characteristic speed is increasing as we go from the left state 
to the right state along this curve. Otherwise the discontinuous data evolves 
into an overturned, triply valued “compression wave” instead of a rarefaction 
wave. However, this overturned solution can still be used to define a flux if 
it is interpreted properly. This is the basis for the Osher solver, which is a 
generalization of the Engquist— Osher method for scalar conservation laws, 
first developed in the context of the small disturbance equation for transonic 
flow [81]. Details for general conservation laws can be found in [181]. 

4.6.3 The HLLE Solver. Another approach is to approximate the full Rie- 
mann solution by a single intermediate state bounded by two waves moving 
at speeds si and $ 2 - The wave speeds should be some approximations to the 
minimum and maximum wave speeds that would arise from this particular 
Riemann data, and the intermediate state can then be calculated by the con- 
dition of conservation. This approach is developed by Harten, Lax, and van 
Leer originally [108] and improved by Einfeldt [78]. 

4.6.4 The Roe Solver. One of the most popular Riemann solvers currently 
in use is due to Roe [195]. The idea is to determine an approximate Riemann 
solution q{x,t) by solving a constant coefficient linear system of conservation 
laws instead of the original nonlinear system. Of course the coefficient matrix 
used to define this linear system must depend on the qi and qr in order to have 
consistency with the nonlinear problem, so we will write the linear system 
for q as 

qt+ A{qi,qr)qx =0 ■ (4.35) 

This linear Riemann problem is relatively easy to solve (see Sect. 3.2), 
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We still have the problem of determining A{qiyqr) in a reasonable way. 
Roe suggested that the following conditions should be imposed on A: 

i) A{qi,qr){qr - qi) = f{qr) ~ f{qi) 

ii) A{qiyqr) is diagonalizable with real eigenvalues (4.36) 

in) A{qiyqr) ^ f\q) smoothly as qi^qr ^ q • 

Condition (4.36z) has two effects. First, it guarantees that any flux- 
difference splitting defined by the matrix will be a flux-difference splitting 
of the true flux / as well. Another effect is that, in the special case where 
qi and qr are connected by a single shock wave or contact discontinuity, the 
approximate Riemann solution agrees with the exact Riemann solution. This 
follows from the fact that the Rankine-Hugoniot condition is satisfied for qi 
and qr in this case, so 

/(9r) - J{qi) = s{qr - qi) 

for some s (the speed of the shock or contact). Combined with (4.36^), this 
shows that qr - qi must, in this situation, be an eigenvector of A with eigen- 
value s and so the approximate solution q[Xyt) also consists of this single 
jump qr — qi propagating with speed s. 

Condition (4.36u) is clearly required in order that the problem qt-\-Aqx ~ 
0 is hyperbolic and solvable. 

Condition (4. 36 in) guarantees that the method behaves reasonably on 
smooth solutions, since if HQi — Qi~i\\ = 0{h) then the linearized equation 
qt + f'iQMx = 0 is approximately valid. It is natural to require that the 
linear system (4.35) agree with the linearization in this case. Since (4.36i) 
guarantees that the method behaves reasonably on an isolated discontinuity, 
it is only when a Riemann problem has a solution with more than one strong 
shock or contact discontinuity that the approximate Riemann solution will 
differ significantly from the true Riemann solution. In practice this happens 
infrequently — near the point where two shocks collide, for example. 

One way to guarantee that both conditions (4.36n) and (4.362u) are sat- 
isfied is to take 

4(9/,9r) = /'(9ave) (4.37) 

for some average value of q, e.g., qave = \{Qi+Qr)‘ Unfortunately, this simple 
choice of gave will not give an A that satisfies (4.36 z) in general. Harten and 
Lax [107] show (see also Theorem 2.1 in [108]) that for a general system with 
an entropy function, a more complicated averaging of the Jacobian matrix in 
state space can be used. This shows that such linearizations exist, but is too 
complicated to use in practice. 

Fortunately, for special systems of equations it is possible to derive suit- 
able A matrices that are very efficient to use relative to the exact Riemann 
solution. The Roe matrix for the simple isothermal equations is given in [141]. 
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density density density 






Fig. 4.6. Computed density profiles using Roe’s approximate Riemann solver, (a) 
Sod’s shock tube problem, (b) Sod problem from a moving reference frzune so that 
the rarefru:tion wave is transonic. An entropy-violating solution is cedculated. (c) 
Same problem with an entropy-fix in the Roe solver 



Roe [195] presents the analogous formulas for the full Euler equations. Ap- 
proximate Riemann solvers of this type have been developed for many other 
applications as well. 

4.6.5 A Sonic Entropy Fix. One disadvantage of Roe’s linearization or 
the all-shock solver is that the resulting approximate Riemann solution con- 
sists only of discontinuities, with no rarefaction waves. This can lead to a 
violation of the entropy condition, particularly if the solution involves tran- 
sonic rarefactions. 

Figure 4.6(a) shows the density computed by using the Roe approximate 
Riemann solver for the Euler equations on the Sod shock tube problem of 
Fig. 2.1, where 

Qi = 3 , ui=0 , pi=3 , for X < 0.5 , . . 

Qr — 1 , Ur = 0 , Pr = I y for X > 0.5 . V * J 

The rarefaction wave is captured smoothly in this case even though the Rie- 
mann solution in each time step has only discontinuties. 

Figure 4.6(b) shows the results for a similar problem using the data 



Qi = 3 , u; = 0.9 , Pi = 3 , for X < 0.5 , 
Qj, = I ^ Ur —0.9 y pr — I y for X > 0.5 . 



(4.39) 



Changing the velocity in this way is equivalent to observing the solution to 
the Sod problem from a moving reference frame, and the solution is identical 
to that of the Sod problem except that the velocity is increased by 0.9 every- 
where. As a result the rarefaction wave is now transonic. As we can see, the 
numerical solution now contains an entropy- violating shock at x 0.5, the 
location of the discontinuity in the initial data. 

In the case of a sonic rarefaction wave, it is necessary to modify the 
approximate Riemann solver in order to obtain entropy satisfying solutions. 
There are various ways to do this. One approach, discussed by Harten and 
Hyman [105], is outlined in [141]. Various other approaches have also been 
used, e.g., [54], [112], [263]. 
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Figure 4.6(c) shows the same test problem as in Fig. 4.6(b) but with the 
entropy fix applied. Some nonsmoothness is still visible in the rarefaction 
wave, but no worse than what is seen with the true Riemann solver, it turns 
out. This results from the fact that Godunov’s method has less numerical 
viscosity near a sonic point than elsewhere, since the rarefaction wave is split 
up over the two neighboring cells (See [100]). 

4.7 High-Resolution Methods 

Regardless of what Riemann solver is used, Godunov’s method will be at best 
first-order accurate on smooth solutions and generally gives very smeared ap- 
proximations to shock waves or other discontinuities. In this section we will 
see how this method can be extended to a method that gives second-order 
accuracy on smooth flow, but which avoids nonphysical oscillations near dis- 
continuities. The key is to use a better representation of the solution, say 
piecewise linear instead of the piecewise constant representation used in Go- 
dunov’s method, but to form this reconstruction carefully by paying attention 
to how the data behaves nearby. In smooth regions the finite-difference ap- 
proximation to the slope can be used to obtain better accuracy, but near a 
discontinuity the “slope” computed by subtracting two values of Q and di- 
viding by h may be huge and meaningless. Using it blindly in a difference 
approximation will introduce oscillations into the numerical solution. 

One particular method will be developed here in a framework that can be 
interpreted as a correction phase following the solution of Riemann problems 
and construction of Godunov fluxes. Many other approaches can be found in 
the literature and a couple of these are briefly described in Sect. 4.8. 

To introduce these ideas we will first consider the scalar advection equa- 
tion 



Qt + uQx - 0 

with u > 0, in which case Godunov’s method reduces to the simple first-order 
upwind method 

After developing a high-resolution version of this method, the ideas will 
be extended to systems of equations and nonlinear problems starting in 
Sect. 4.7.10. 

4.7.1 Reconstruct— Solve— Average. There is another interpretation of 
Godunov’s method that will be useful in developing higher-order accurate 
methods of this type. Recall that g”(x,tn) denotes the piecewise constant 
function with value Qf in cell €{, We defined the numerical flux of Godunov’s 
method by advancing the solution with this data to obtain the interface value 
q^{Xy t) over the time interval [in, in+ij* Another way to describe Godunov’s 
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method is to take the advanced solution q'^{x,tn-^i) at the end of the time 
step, and average this function over grid cell Ci to obtain 

It follows from the integral form of the conservation law that this gives exactly 
the same value as the flux-diflferencing method, though implementing it in 
this form would be more difficult since q^{x, ^n+i) is not constant over Ci, and 
this integral would be difficult to evaluate directly in general. The beauty of 
the flux-differencing approach is that we do not need to evaluate this integral, 
but can find it by integrating the flux function, which is constant on the time 
interval of integration. 

But in generalizing Godunov’s method to higher-order methods, it is use- 
ful to consider what would happen if we took this approach with a differ- 
ent choice of that better approximates a smooth function. We can 

think of q^{x,tn) as a reconstruction of a function from the discrete values 
Q^, the cell averages of the function. Instead of a piecewise constant func- 
tion we might reconstruct a piecewise linear function or some other function 
q^(x,tn)^ We can then generalize Godunov’s method to an algorithm that 
takes the following general form in each time step: 

Algorithm RSA (Reconstruct-Solve— Average): 

1. Reconstruct a function q^{x, tn) defined for all x from the cell averages 

Qi- 

2. Solve the hyperbolic equation exactly (or approximately) with this initial 

data to obtain q^{x,tn~\-i) a time At later. 

3. Average this function over each grid cell to obtain 

^ . 

With a piecewise constant reconstruction we can solve the problem in 
Step 2 exactly, giving Godunov’s method. For linear systems we can solve 
this problem exactly even with more complicated initial data, such as the 
piecewise linear function considered in the next section. For nonlinear prob- 
lems we may not be able to solve the problem in Step 2 exactly, but we 
will still be able to improve the accuracy by using an approximate solution 
together with piecewise linear data. 

4.7.2 Piecewise Linear Reconstruction. To achieve better than first- 
order accuracy, we must use a better reconstruction than a piecewise con- 
stant function. From the cell averages we can construct a piecewise linear 
function of the form 

q^{x, tn) = Qi + CTiix - Xi) for Xi < x < Xi^i , (4.40) 

where 
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+ ^i+i) ~Xi + ~h (4,41) 

is the center of the z’th grid cell and is the slope on the i’th cell. The 
linear function defined by (4.40) on the z’th cell is defined in such a way 
that its value at the cell center Xi is Q^. More importantly, the average 
value of q'^ix.tn) over cell Ci is Qf (regardless of the slope af), so that the 
reconstructed function has the cell average Q^. This is crucial in developing 
conservative methods for conservation laws. Note that Steps 2 and 3 are 
conservative in general, and so Algorithm RSA is conservative provided we 
use a conservative reconstruction in Step 1, as we have in (4,40). Later we 
will see how to write such methods in the standard conservation form (4.18). 

For the scalar advection equation qt uqx = 0, we can easily solve the 
equation with this data, and compute the new cell averages as required in 
Step 3 of Algorithm RSA. We have 

g”(x,t„+i) = - uk,tn) . 

Until further notice we will assume that u > 0 and present the formulas for 
this particular case. The corresponding formulas for u < 0 should be easy to 
derive, and in Sect. 4.7.7 we will see a better way to formulate the methods 
in the general case. 

Suppose also that \uk/h\ < 1. Then it is straightforward to compute (see 
also Sect. 4.7.7) that 

Qr' = Y ~ t) 

'ifJc 1 /r 

= Q7- y(Qr - QU) 

4.7.3 Choice of Slopes. Choosing af = 0 gives Godunov’s method (the 
upwind method for the advection equation). To obtain a second-order accu- 
rate method we want to choose nonzero slopes in such a way that of approx- 
imates the derivative q^ over the z’th grid cell. Three obvious possibilities 
are: 



Centered slope: 


' 2h 


(Fromm) , 


(4.43) 


Upwind slope: 


II 

r 


(Beam- Warming) , 


(4.44) 


Downwind slope: 


- h 


(Lax-Wendroff) . 


(4.45) 



The centered slope might seem like the most natural choice to obtain second 
order accuracy, but in fact all three choices give the same formal order of 
accuracy, and it is the other two choices that give methods we have already 
derived in other ways. Only the downwind slope results in a centered 3- 
point method, and this choice gives the Lax-Wendroflf method (4.10). The 
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upwind slope gives a fully-upwinded 3-point method, which is simply Beam- 
Warming. 

The centered slope may seem the most symmetric choice at first glance, 
but due to the fact that the reconstructed function is then advected in the 
positive direction, the final updating formula turns out to be a non-symmetric 
4-point formula, which is known as Fromm’s method. 

4.7.4 Oscillations. Second-order methods such as Lax-Wendroff (or Beam- 
Warming or Fromm’s method) give oscillatory approximations to discontin- 
uous solutions. This can be easily understood using the interpretation of 
Algorithm RSA. 

Consider the Lax-Wendroff method, for example, applied to piecewise 
constant data with values 
_ f 1 if i < J 

\ 0 if i > J . 

Choosing slopes in each grid cell based on the Lax-Wendroff prescription 
(4.45) gives the piecewise linear function shown in Fig. 4.7(a). The slope af 
is nonzero only for i = J. 





Fig. 4.7. (a) Grid values Q” and reconstructed ^(‘, in) using Lax-Wendroff slopes, 
(b) After advection with ku = h/2. The dots show the new cell averages Note 

the overshoot 



The function q^{x^tn) has an overshoot with a maximum value of 3/2 
regardless of h. When we advect this profile a distance uk, and then compute 
the average over the J’th cell, we will get a value that is greater than 1 for 
any k with 0 < uk < h. The worst case is when uk = /i/2, in which case 
q^{x, in+i) is shown in Fig. 4.7(b) and = 9/8. In the next time step this 
overshoot will be accentuated, while in cell J — 1 we will now have a positive 
slope, leading to a value j that is less than 1. This oscillation then grows 
with time. 

The slopes proposed in the previous section were based on the assumption 
that the solution is smooth. Near a discontinuity there is no reason to believe 
that introducing this slope will improve the accuracy. On the contrary, if one 
of our goals is to avoid nonphysical oscillations, then in the above example we 
must set the slope to zero in the J’th cell. Any (Tj<0 will lead to > 1, 
while a positive slope wouldn’t make much sense. On the other hand we 
don’t want to set all slopes to zero all the time, or we simply have the first- 
order upwind method. Where the solution is smooth we want second order 
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accuracy. Moreover, we will see below that even near a discontinuity, once 
the solution is somewhat smeared out over more than one cell, introducing 
nonzero slopes can help keep the solution from smearing out too far, and 
hence will significantly increase the resolution and keep discontinuities fairly 
sharp, as long as care is taken to avoid oscillations. 

This suggests that we must pay attention to how the solution is behaving 
near the z’th cell in choosing our formula for af . (And hence the resulting 
updating formula will be nonlinear even for the linear advection equation!). 
Where the solution is smooth we want to choose something like the Lax- 
Wendroff slope. Near a discontinuity we may want to “limit” this slope, using 
a value that is smaller in magnitude in order to avoid oscillations. Methods 
based on this idea are known as slope- limiter methods. This approach was 
introduced by van Leer in a series of papers [246] through [248], where he 
developed the MUSCL scheme for nonlinear conservation laws (Monotonic 
Upstream-centered Scheme for Conservation Laws). The same idea in the 
context of flux limiting, reducing the magnitude of the numerical flux to 
avoid oscillations, was introduced in the flux-corrected transport (FCT) 
algorithms of Boris and Book [37]. We can view this as creating a hybrid algo- 
rithm that is second order accurate in smooth regions but which reduces to a 
more robust first-order algorithm near discontinuities. This idea of hybridiza- 
tion was also used in early work of Harten and Zwas[110]. An enormous va- 
riety of methods based on these principles have been developed in the past 
two decades. One of the algorithms of this type that is best known in the 
astrophysics community is the piecewise parabolic method (PPM) of 
Woodward and Colella[59], which uses a piecewise quadratic reconstruction, 
with appropriate limiting. 

4.7.5 Total Variation. How much should we limit the slope in a piecewise 
linear reconstruction? Ideally we would like to have a mathematical prescrip- 
tion that will allow us to use the Lax-Wendroff slope whenever possible, for 
second-order accuracy, while guaranteeing that no non-physical oscillations 
will arise. To achieve this we need a way to measure “oscillations” in the 
solution. This is provided by the notion of the total variation of a function. 
For a grid function Q we define 

oo 

TV{Q)= \Qi-Qi-i\- (4-46) 

i= — oo 

For an arbitrary function g(x) we can define 
N 

TV(q) = supX; - 9(0-1 )l , (4-47) 

j=l 

where the supremum is taken over all subdivisions of the real line — oo = 
Co < < • • ‘ < = 00 . Note that for the total variation to be finite Q or q 

must approach constant values as a: -> ±oo. 
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The true solution to the advection equation simply propagates at speed 
u with unchanged shape, so that the total variation TV{q{^, t)) must be con- 
stant in time, A numerical solution to the advection equation may not have 
constant total variation, however. E the method introduces oscillations, then 
we would expect the total variation of to increase with time. We can thus 
attempt to avoid oscillations by requiring that the method does not increase 
the total variation: 



Definition 4.1. A 2-level method is called total variation diminishing 
(TVD) if, for any set of data the values computed by the method 
satisfy 

ry(Q"+^) < TViQ'^) . (4.48) 



For a scalar conservation law, the exact solution has nonincreasing varia- 
tion and so this is a reasonable condition to impose on a numerical method. 
Harten [103] introduced the use of this criterion in developing and analyzing 
numerical methods. For a scalar equation, steps 2 and 3 of Algorithm RSA 
are TVD, and so the overall method is TVD provided that the reconstruction 
step does not increase the variation, i.e., as long as 

TV{^) < TV{Q^) . (4.49) 



4.7.6 Slope-Limiter Methods. Now let’s return to the derivation of nu- 
merical methods based on piecewise linear reconstruction, and consider how 
to “limit” the slopes so that (4.48) is satisfied. Note that setting af = 0 
works, since the piecewise constant function has the same TV as the discrete 
data. Hence the first-order upwind method is TVD for the advection equa- 
tion. Hence upwind may smear solutions but cannot introduce oscillations, a 
familiar result. 

One choice of slope that gives second-order accuracy for smooth solutions 
while still satisfying the TVD property is the minmod slope 



af = minmod 



Q7+1-Q 



h 



h 



')■ 



(4.50) 



where the minmod function of two arguments is defined by 



minmod (a, b) 



a if \a\ < |5| and a6 > 0 

b if |6| < \a\ and > 0 

0 if a6 < 0 . 



(4.51) 



Note that if a and b have the same sign then this selects the one which is 
smaller in modulus, else it returns zero. 

Rather than defining the slope on the z’th cell by always using the down- 
wind difference (which would give Lax-Wendroff ) , or by always using the 
upwind difference (which would give Beam- Warming) , the minmod method 
compares the two slopes and chooses the one which is smaller in magnitude. 
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upwind Lax-Wendroff 







Fig. 4.8. Tests on the advection equation with different limiters. All results are at 
time t = ly after one revolution with periodic boundary conditions 



If the two slopes have different sign, then the value Qf must be a local maxi- 
mum or minimum, and it is ea^y to check in this case that we must set ^ 
in order to satisfy (4.49). 

Figure 4.8 shows a comparison of the upwind, Lax-Wendroff, and min- 
mod methods for an advection problem with initial data consisting of both a 
smooth hump and a square wave. The advection velocity is u = 1 and periodic 
boundary conditions are used on [0, 1] so that at integer times ^ = 0, 1, 2, ... 
the solution should agree with the initial data. The figure shows solutions at 
t — I on a. grid with h = 0.01 and k = 0.005 (so the Courant number is 0.5 
— better results with all methods would be obtained with a Courant number 
closer to 1). We see that the minmod method does a fairly good job of main- 
taining good accuracy in the smooth hump and also sharp discontinuities in 
the square wave, with no oscillations. 

Sharper resolution of discontinuities can be achieved with other limiters 
that do not reduce the slope as severely as minmod near a discontinuity. One 
popular choice is the monotonized central-difference limiter (MC- 
limiter), which was proposed by van Leer [247]: 



erf = minmod 



Q?+i - Q?-i 

2h 
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This compares the central-difference of Fromm’s method with twice the one- 
sided slope to either side. In smooth regions this reduces to the centered slope 
of Fromm’s method but near discontinuities it gives sharper resolution than 
minmod while remaining TVD. 



4.7.7 Flux Formulation with Piecewise Linears. The slope-limiter 
methods described above can be written as flux-differencing methods of the 
form (4.18). The updating formulas derived above can be manipulated alge- 
braically to determine what the numerical flux function must be. Alterna- 
tively, we can derive the numerical flux by computing the exact flux through 
the interface X{ using the piecewise linear solution by integrating 

uq'^{xiyt) in time from to For the advection equation this is easy to 
do and we find that 



1 

= kl 



t) dt 



= uQi_i -f -u(h - ku)af_^ . 

Using this in the flux-differencing formula (4.18) gives 



= gr - ^(gr - qu ) - “(/» - mk” - , 

which agrees with (4.42). 

If we also consider the case u < 0, then we will find that in general the 
numerical flux for a slope-limiter method is 



f '^Qi-i + — ku)a^_^ if u > 0 

\ uQf - |u(/i -f A:u)af if u < 0 , 



(4.52) 



where erf is the slope in the z’th cell Cj, chosen by one of the formulas discussed 
previously. 

Rather than associating a slope erf with the i’th cell, the idea of writing 
the method in terms of fluxes between cells suggests that we should instead 
associate our approximation to with the cell interface at Xi where Ff is 
defined. Across the interface Xi we have a jump 



^Q? = Q7-Qti 



(4.53) 



and this jump divided by h gives an approximation to This suggests that 
we write the flux (4.52) as 



Fr = u-Q? + u+gr_i 





(4.54) 



where are defined in (4.29). If 5f is the jump AQf itself then this gives the 
Lax-Wendroff method. We see that the Lax-Wendroff flux can be interpreted 
as a modification to the upwind flux (4.28). This observation is crucial in the 
development of high-resolution methods. 
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4.7.8 Flux Limiters, Prom the above discussion it is natural to view Lax- 
Wendroff as the basic second-order method based on piecewise linear recon- 
struction. Other second-order methods have fluxes of the form (4.54) with 
different choices of 6f. The slope-limiter methods can then be reinterpreted 
as flux-limiter methods by choosing Sf to be a limited version of (4.53). 
In general we will set 

<5r = m)AQ2 , (4.55) 



where 



(4.56) 



The index I here is used to represent the interface on the upwind side of X{: 



( i — I ifw>0 
ifu<0. 



(4.57) 



The ratio Of can be thought of as a measure of the smoothness of the data 
near X{. Where the data is smooth we expect Of ^ 1 (except at extrema). 
Near a discontinuity we expect that Of may be far from 1. 

The function </>(0) is the flux-limiter function, whose value depends on 
the smoothness. Setting (f){0) = 1 for all 0 gives the Lax-Wendroff method, 
while setting <j){0) = 0 gives upwind. More generally we might want to devise 
a limiter function (j) that has values near 1 for 0 near 1, but which reduces 
(or perhaps increases) the slope where the data is not smooth. 

There are many other ways one might choose to measure the smoothness 
of the data besides the variable 0 deflned in (4.56). However, the framework 
proposed above results in very simple formulas for the 0 function correspond- 
ing to many standard methods, including all the methods discussed so far. 

In particular, note the very nice feature that choosing 



m = e 



(4.58) 



results in (4.55) becoming 

K = (^) ^< 3 ? = 40 ? ■ 

Hence this choice results in the jump at the interface upwind from Xi being 
used to define 6f instead of the jump at this interface. As a result, the method 
(4.54) with the choice of “limiter” (4.58) reduces to the Beam- Warming 
method. 

Since the centered difference (4.43) is the average of the one-sided slopes 
(4.44) and (4.45), we also find that Fromm’s method can be obtained by 
choosing 



(4.59) 
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Also note that = 2 corresponds to using Sf = 2AQf^ i.e., twice the 
jump at this interface, while <l>{6) = 20 results in using twice the jump at the 
upwind interface. Recall that these are necessary ingredients in some of the 
slope limiters discussed in Sect. 4.7.6. 

Translating the various slope limiters into flux-limiter functions, we find 
the following expressions for some standard methods: 



Linear methods: 
upwind : 
Lax-Wendroff : 
Beam- Warming : 
Fromm : 



m=o 

(t>{e) = 1 

m = 0 



High-resolution 
limiters: 
minmod : 
superbee : 
MC: 
van Leer : 



(4.60) 



(j){e) 

m 

(t>{e) 

m 



minmod (1,0) 

max(0, min(l,20), min(2,0)) 

max(0, min((H-0)/2, 2, 20)) 
0 -\-\e\ 

T+M ' 



A wide variety of other limiters have also been proposed in the literature. 

4.7.9 TVD Limiters. For simple limiters such as minmod, it is clear from 
the derivation as a slope-limiter (Sect. 4.7.6) that the resulting method is 
TVD, since it is easy to check that (4,49) is satisfied. For more complicated 
limiters we would like to have an algebraic proof that the resulting method 
is TVD. A fundamental tool in this direction is a theorem of Harten[103], 
which can be used to derive explicit algebraic conditions on the function 
(f> required for a TVD method. Sweby[234] derived explicit constraints for 
limiter functions and shows that we require (see also [141]): 

0 < ^(^) ^ minmod(2, 20) . (4-61) 

This defines the TVD region in the 0-<f) plane: the curve (f){6) must lie in this 
region, which is shown as the shaded region in Fig. 4.9(a). This figure also 
shows the functions (j){9) from (4.60) for the Lax-Wendroff, Beam- Warming, 
and Fromm methods. All of these functions lie outside the TVD region for 
some values of 0, and these methods are not TVD. This graphical analysis of 
(p was first presented by Sweby [234], who analyzed a wide class of flux-limiter 
methods. 

Note that for any second-order accurate method we must have 0(1) — 1. 
Sweby found, moreover, that it is best to take 0 to be a convex combina- 
tion of 0 = 1 (Lax-Wendroff) and 0 — 0 (Beam- Warming). Other choices 
apparently give too much compression, and smooth data such as a sine wave 
tends to turn into a square wave as time evolves. Imposing this additional 
restriction gives the ‘‘second order TVD” region of Sweby which is shown in 
Fig. 4.9(b). 




74 



R.J. LeVeque 




Fig. 4.9. Limiter functions (a) The shaded region shows where function 

values must lie for the method to be TVD. The second-order linear methods have 
functions (f>{0) that leave this region, (b) The shaded region is the Sweby region of 
second-order TVD methods. The minmod limiter lies along the lower boundary, (c) 
The superbee limiter lies along the upper boundary, (d) The MC limiter is smooth 
at </> = 1 



The “high-resolution” limiter functions from (4.60) are all seen to satisfy 
the constraints (4.61), and these limiters all give TVD methods. The (p func- 
tions are graphed in Fig. 4.9. Note that minmod lies along the lower boundary 
of the Sweby region while superbee lies along the upper boundary. The fact 
that these functions are not smooth at ^ — 1 corresponds to the fact that 
there is a switch in the choice of one-sided approximation used as 6 crosses 
this point. This can lead to a loss in accuracy near inflection points. For full 
second order accuracy we would like the function (p to be smooth near 0 = 1, 
as for the MC limiter. The van Leer limiter is an even smoother version of 
this. 

4.7.10 Linear Systems. The slope-limiter or flux-limiter methods can 
also be extended to systems of equations. This is most easily done in the flux- 
limiter framework, and will be illustrated first for the linear system Aga; = 
0 . 

We can write A = A~^ -h A“, where A = RAR~^ is decomposed based 
on the sign of each eigenvalue: A~^ = RA~^R~^ and A~ ~ RA~ R~^ where 
A"^, for example, has the positive part of each A on the diagonal, so negative 
eigenvalues are replaced by zero. Using this, Godunov’s method for a linear 
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system (the generalization of the upwind method) can be written in terms of 
the flux function 



F{Qi-i , Qi) = {A+Qi-x + A-Qi) . (4.62) 

The Lax-Wendroff method (4.10) can also be written in flux-differencing 
form (4.18) if we define the flux by 

F{Qi-uQi) = \A{Qi-i VQi) + . (4.63) 

We can rewrite this as 

F{Qi-uQi) = (A+Qi_i + A-Q0 + i|^| (l - ^Ml) {Qi-Qi-i) , (4.64) 
where |^| = — A~ . 

In the form (4.64), we see that the Lax-Wendroff flux can be viewed as 
being composed of the Godunov flux (4.62) plus a correction term, just as 
for the scalar advection equation. To define a flux-limiter method we must 
limit the magnitude of this correction term based on how the data is varying. 
But for a system of equations, AQi — Qi - is a vector and it is not 
so clear how to compare this vector with the neighboring jump AQi^i or 
AQi^i to generalize (4.55), nor which neighboring jump to consider, since 
the “upwind” direction is different for each eigen-component. The solution, of 
course, is that we must decompose the correction term in (4.64) into eigen- 
components and limit each scalar eigen-coefficient separately based on the 
algorithm for scalar advection. 

We can rewrite the correction term as 

^1^1 (l - ^l^l) (Qi - Qi-x) = \\A\ (l - ^l^l) , 

where 

m 

{Qi-Qi-i) = Y^a^y 

i=l 

gives the decomposition of the jump in Q across this interface into eigenvec- 
tors of A, i.e., the solution of the Riemann problem. 

The flux-limiter method is defined by replacing the scalar coefficient af 
by a limited version, based on the scalar formulas of Sect. 4.7.8. We set 

= af^(0f ) , (4.65) 



where 




with 




i- 1 
i -H 1 



if > 0 
if AP < 0 



(4.66) 



and (j) is one of the limiter functions of Sect. 4.7.8. The flux function for the 
flux-limiter method is then 




76 R.J. LeVeque 



Fi — A^Qi-i -i- A Qi Fi , (4.67) 

where the first term is the upwind flux and the correction flux F^ is defined 
by 

Fi = l\A\(^l-'^\A\^'£ay. (4.68) 

Note that in the case of a scalar equation, we can take = 1 as the eigenvec- 
tor of A = u, so that AQi = aj which is what we called 6i in Sect. 4.7.8, The 
formula (4.67) then reduces to (4.54). For a system of equations, the method 
just presented can also be obtained by diagonalizing the linear system and 
applying the scalar flux limiter method to each resulting advection equa- 
tion. This is what we are doing at each cell interface by solving the Riemann 
problem. 

Also note that the flux Fi depends not only on Qi-i and Qi, but also on 
and Qi^i in general, because neighboring jumps are used in defining 
the limited values df in (4.68). The flux-limiter method thus has a 5-point 
stencil rather than the 3-point stencil of Lax-Wendroff. This is particularly 
important in specifying boundary conditions (see Sect. 4.9). 

4.7.11 Implementation. For the constant coefficient linear system, we 
could compute the matrices A”, and \A\ once and for all and compute 
the fluxes directly from the formulas given above. However, with limiters we 
must solve the Riemann problem at each interface to obtain a decomposition 
of AQi into waves and wave speeds and these can be used directly in 
the computation of without ever forming the matrices. This approach 

also generalizes directly to nonlinear systems of conservation laws, where we 
do not have a single matrix A but can still solve a Riemann problem at each 
interface for waves and wave speeds. This generalization is discussed briefly 
in the next section. 

To accomplish this most easily, note that if we use the flux (4.67) in the 
flux-differencing formula (4.18) and then rearrange the upwind terms, we can 
write the formula for as 

=Q7- - liFi+i - Fi) , (4.69) 

where Fi is given by (4.68). Here we drop the superscript n from the current 
time step since we will need to use superscript p below to denote the wave 
family. Each of the terms in this expression can be written in terms of the 
waves and wave speeds A^: 

m 

A+AQi = ^(AP)+afrP, (4.70) 

p=l 
m 

= X^(A")-afrP , 

p=i 



A- AQi 



(4.71) 
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and 

P=1 ^ ' 

Once we have solved the Riemann problem for the waves limited 

waves a^rP, and speeds A^, we can compute everything needed for the high- 
resolution method (4.69). 

4.7.12 Extension to Nonlinear Systems. This approach is now easily 
extended to nonlinear hyperbolic systems. Given states Qi-\ and Qi, the 
solution to the Riemann problem will yield a set of waves Wf G IR”^ and 
speeds Af G IR, analogous to the the linear problem, though now the speeds 
will vary with i and so will the directions of the vectors Wf in phase space; 
they will no longer ail be scalar multiples of a single set of eigenvectors 
rP. The quantities AQi and AQi are generalized to flux differences 
denoted by A'^Aqi and Aqi^ with the property that 

A-Aqi + A+Aqi = f{Qi) - . (4.72) 

Note that for the linear case f{Qi) - = AAQi and this property is 

satisfied. In general we can think of setting 

m 

A-Aqi = (4.73) 

P=l 

m 

A+Aqi = X^(Af)+Wf, (4.74) 

p=l 

a direct extension of (4.70). There are, however, some issues concerned with 
rarefaction waves and entropy conditions that make the nonlinear case more 
subtle, see [144] for a more complete discussion of this approach. 

Once the waves, speeds, and fiux differences have been suitably defined, 
the algorithm is virtually identical with what has already been defined in the 
linear case. We set 

= Q? - - Fi) , (4.75) 

where the first term corresponds to the Godunov method and the second 
term gives second-order corrections, with 

^i = ^ElAl’|(l-^|Af|)wf. (4.76) 

Here VVf represents a limited version of the wave >Vf , obtained by comparing 
this jump with the jump Wf in the same family at the neighboring Riemann 
problem in the upwind direction, so 

r i - 1 if Af > 0 

I ^ + 1 if Af < 0 . 
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This limiting procedure is slightly more complicated than in the constant 
coefficient case since Wf and W/ are in general no longer scalar multiples 
of the same vector r^. So we cannot simply apply the scalar limiter function 
(f){9) to the ratio of these scalar coeflBcients as in the constant coefficient 
linear case. Instead we can, for example, project the vector Wj onto Wf and 
compare the length of this projection with the length of Wf . 

4.8 Other Approaches 

One approach to extending Godunov’s method to a high-resolution method 
has been outlined above. Other approaches are also commonly used. One 
variant of the above method, often called the MUSCL approach following 
van Leer [248] , consists of computing the flux 

by solving the Riemann problem using modified data and 

and then setting to be the flux along x/t = 0 in this Riemann solu- 
tion. The data and are constructed as approximations to 

g(xi,f„+i/ 2 ) at the midpoint in time, approaching the interface from the 
left and right, respectively. These are generally based on piecewise linear (or 
piecewise quadratic, in the case of PPM) reconstructions of the solution in 
each grid cell, together with characteristic extrapolation forward in time. 
Limiters are typically used in this reconstruction phase to avoid oscillations. 
Because this flux approximation is centered in time, and based on at least a 
linear approximation in space, such a method can be second-order accurate 
for smooth solutions. 

It is hard to achieve better than second-order accuracy with the ap- 
proaches discussed so far. For problems with discontinuous solutions this 
may not matter. Accuracy is degraded near the discontinuity anyway, and 
even smooth components of the flow may lose accuracy after interacting with 
shock wave in a nonlinear problem (see, e.g., [72]). High- resolution methods 
with at best second-order accuracy are frequently used in practice. However, 
for some problems where there is considerable small-scale structure that is 
smoothly varying, it may be desirable to use methods with a higher formal 
order of accuracy. The semi-discrete approach described next is then a good 
approach. 

4.8.1 Semi-discrete Methods. Rather than attempting to discretize si- 
multaneously in space and time, another approach is to first discretize the 
PDE in space alone, obtaining a system of ODE’s for the evolution of cell 
averages based on instantaneous values of the flux through each side. Then 
this ordinary differential equation is discretized in time by an accurate ODE 
method, typically a Runge-Kutta method. 
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Let Qi{t) be the z’th cell average at time for the moment assumed to 
be defined for all values of L Then the integral form of the conservation law 
applied on the z’th cell gives 

= (4.77) 

where Fi{t) ^ is the flux through the left cell edge at this instant 

in time, for example. Suppose we have some formula for this flux in terms of 
the cell average, say 

(In practice it may depend on more than just the two neighboring values.) 
Then we have the semi-discrete method 

Q'iit) = Qi+i) - m-i. a")] , (4-78) 

which has been discretized in space, but not yet in time. 

If we discretize (4.78) by the forward Euler method for ODE’s, we would 
obtain 

= Q?~ l[HQ?,Q?+i)-nQ?-uQ7)} • 

This has the same form as (4.20). However, if we now view ^(Q?_i,Q”) 
as an approximation to the flux at time tn, then this will only be first-order 
accurate in time, no matter how accurate the spatial discretization. We previ- 
ously achieved second-order accuracy by determining F so that 
approximates the average flux over or the pointwise value of the 

flux at in-fi/ 2 - That approach is hard to extend to higher order since it re- 
quires using more terms in a Taylor series expansion, involving mixed x-t 
derivatives. 

The advantage of the semi-discrete approach is that achieving higher ac- 
curacy in space and time are decoupled processes. Instead of forward Euler we 
could apply a higher-order Runge-Kutta method to (4.78). These are multi- 
stage methods that require evaluating F at one or more intermediate times 
within each time step, but if F produces a sufficiently accurate instanta- 
neous flux based on any given set of cell averages, then arbitrarily high-order 
accuracy can be obtained. Special TVD Runge-Kutta methods have been 
developed for use in the context of semi-discrete methods for conservation 
laws [216]. 

A flux function F delivering high-order accuracy in space can be ob- 
tained for smooth solutions by using higher-order interpolating polynomials. 
One popular class of methods are the ENO schemes (Essentially Non- 
Oscillatory) which adaptively choose stencils for high-order interpolation 
based on the behavior of the solution in order to avoid interpolating across 
a discontinuity. Some references on ENO schemes, and more recent work on 
WENO (weighted ENO), include [104], [109], [119], [155], [215], [216]. 
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4.8.2 Flux- Vector Splitting. We now return to the question of defining a 
first-order accurate interface flux and consider an alternative to the Godunov 
method presented above, but which has many similarities and in particular 
is identical for linear systems. The Godunov method can be interpreted as 
a flux-difference splitting method, since according to (4.72) the differ- 
ence f{Qi) — f{Qi-i) is split into a left-going portion A~ Aqi which modifies 
Qi-i and a right-going portion which modifies Qi. This splitting is 

accomplished by solving the Riemann problem between Qi~i and Qi. 

An alternative is to split each flux /(Qi), evaluated at the cell average, 
into a left-going part /“ and a right-going part so we have 

f{Qi) = /r + /^ • 

We can then define the interface flux 

F, = fl, + f- 

based on the portion of each cell-centered flux approaching the interface. A 
method of this form is called a flux-vector splitting method, since it is 
the flux vector /(Qi) which is split instead of the flux difference. 

One possibility for the Euler equations is based on the observation that 
these equations have the special property that f{aq) = oif{q) for any scalar 
a and are said to be homogeneous of degree 1. From this it follows that 

/(«) = f {<!)<} 

for any state g, which is not true for nonlinear functions that do not have this 
homogeneity. Hence if At is the Jacobian matrix /'(Qi) (or some approxima- 
tion to it), then a natural flux- vector splitting is given by 

/r = ^7Qi . ft = ^tQ^ • 

We thus have 

Fi = Af_,Qi-i+A-Qi. 

For a constant coefficient linear problem (A{ = A), this agrees with (4.62) 
and this flux-vector splitting is identical to the flux-difference splitting of 
Godunov’s method. For a nonlinear problem they are typically different. 

The above splitting for the Euler equations is called Steger- Warming 
flux- vector splitting [227] in the aerodynamics community. An equivalent 
method, known as the beam scheme, was introduced earlier in astrophysics 
[206] based on a different viewpoint; each state Qi is decomposed into distinct 
beams of particles traveling at the different wave speeds. 

For transonic flow problems in aerodynamics, the flux- vector splitting 
given above suffers from the fact that the splitting does not behave smoothly 
as the Mach number passes through 1 (where the u — c or u -h c characteristic 
speed changes sign). A smoother flux- vector splitting was introduced by van 
Leer [249], and other variants and improvements have since been introduced, 
e.g., [152], [153]. 
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The beam scheme was motivated by considering the collisionless Boltzman 
equation as a model of gas dynamics, as discussed in [108]. More recently 
the BGK approximation to the collision term, due to Bhatnagar, Gross, and 
Krook[31], has been used to obtain more accurate Boltzman schemes. BGK 
methods have the advantage of not requiring Riemann solutions as such, and 
may be particularly useful for low density gases. Prendergast and Xu [189] 
give a nice introduction to this approach. 

4.8.3 The Marquina Flux. A flux- vector splitting recently introduced by 
Marquina [73], [157] appears to be very robust on difficult astrophysical com- 
putations. Let rf and X^{Qi) be the right eigenvectors and eigenvalues of 
/'(Qi), respectively, and decompose both Qi and f{Qi) into these eigenvec- 
tors: 

m m 

HQi) = E ’ ^« = E ■ (4.79) 

p=l p^l 

Let 



<5f = sgn(A'’(Q0) , 



AP = max \\P{q)\ . 
g=Qi~i,Qi 



Then set 

<Ar = 



0 if AP(Qi)>0andAP(<3i_i)>0 

if AP(Qi) < 0 and AP(Qi_i) < 0 
- APi4) if A^CgOA^Wi-i) < 0 , 



= I Cl 

The Marquina flux is then given by 



if AP(Qi) < 0 and AP(Qi_i) < 0 
if AP(Qi) > 0 and AP(Qi_i) > 0 
if AP(Qi))iP(Qi.i) < 0 . 



fr = E[*Pi’'S'-.+*r’-n 

p=l 



(4.80) 



If X^(Qi) and A^(Qi-i) have the s£une sign, then this just reduces to the 
Steger- Warming splitting. When the eigenvalue changes sign, this introduces 
some additional dissipation into the flux in a manner motivated by the highly- 
diffusive Lax-Priedrichs method [157]. In the case X^(Qi-i) < 0 < A^(Qt) 
this dissipation enforces the entropy condition. When X^(Qi~i) > 0 > X^(Qi) 
there is a shock rather than a rarefaction wave, but this additional dissipation 
can still be useful in certain circumstances such as slowly-moving shocks (see 
Sect. 7.5). 

Marquina used this flux function in conjunction with an ENO-based semi- 
discrete method and Runge-Kutta time stepping (see Sect. 4.8.1) to achieve 
higher-order accuracy. The flux function can also be used in conjunction with 
the high-resolution method developed in Sect. 4.7.12 by using Fi to define a 
flux-difference splitting via 
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A Aqi = Fi- f{Qi-i) , 

A-^Aqi - f(Qi)~Fi, 

Any numerical flux definition can be rewritten as a flux-difference splitting by 
this device. To apply the high-resolution method we must also define waves 
yVf and speeds for use in the second-order correction terms of (4.76). Prom 
(4.79) we have 

m 

Qi - Qi~i = J , 

p=l 

which suggests defining the p’th wave Wf as 

' t t 1—1 

The corresponding speed might be defined as 

This solver has been implemented in CLAWPACK by D. Bale and used for 
some of the relativistic calculations presented in Sect. 9. It was found to work 
well in cases where other solvers failed. 

4.9 Boundary Conditions 

So far we have only studied methods for updating the cell average as- 
suming that we have neighboring cell values and and perhaps 

values further away as needed in order to compute the fluxes and F/^j. 
In practice we must always compute on some finite set of grid cells covering a 
bounded domain, and in the first and last cells we will not have the required 
neighboring information. Instead we have some set of physical boundary con- 
ditions that must be used in updating these cell values. One approach is to 
develop special formulas for use near the boundaries, which will depend both 
on what type of boundary condition is specified and on what sort of method 
we are trying to match. However, in general it is much easier to think of ex- 
tending the computational domain to include a few additional cells on either 
end, called ghost cells, whose values are set at the beginning of each time 
step in some manner that depends on the boundary conditions and perhaps 
the interior solution. Then these values provide the neighboring cell values 
needed in updating the cells near the physical domain. The updating formula 
is then exactly the same in all cells, and there is no need to develop a special 
flux-limiter method, say, that works with boundary data instead of initial 
data. Instead the boundary conditions must be used in deciding how to set 
the values of the ghost cells, but this can generally be done in a way that 
depends only on the boundary conditions and is decoupled entirely from the 
choice of numerical method that is then applied. 
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Suppose the problem is on the physical domain [a, 6], which is subdivided 
into cells Ci, C 2 , • • C;v with x\ = a and xn-\-x = so that h = {b- o)/7V. 
If we use a method for which F{ depends only on Qi-i and Qi, then we need 
only one ghost cell on either end. The ghost cell Co = [a — h,a) allows us to 
calculate the flux Fi at the left boundary while the ghost cell = [b, 6+h) 
is used to calculate at x — 6. With a flux-limiter method of the type 

developed above, we will generally need two ghost cells at each boundary 
since, for example, the jump Qo - Q-i will be needed in limiting the flux 
correction in F\ . For a method with an even wider stencil, additional ghost 
cells would be needed. 

We will refer to the solution in the original domain [a, 6] as the interior 
solution, which is computed in each time step by the numerical method. At 
the start of each time step we have the interior values Qi, . . . , obtained 
from the previous time step (or from the initial conditions if n = 0), and we 
apply a boundary condition procedure to fill the ghost cells with values 
Qo^ Qat +15 before applying the method on the next time step. We will 
look at several examples to see how the ghost cell values might be set in order 
to implement various physical boundary condtions. 

4.9.1 Periodic Boundary Conditions. Periodic boundary conditions of 
the form q{a, t) = q{by t) are very easy to impose with any numerical method. 
In updating Qi we need values Qo to the left and Q 2 to the right (for a 3- 
point method). By periodicity the value Qo should agree with the value Qat 
in the last cell. One could code the formula for updating Qi separately to use 
Qiv in place of the value Qf_i that would normally be used for i > 1, but 
it is simpler to use the ghost-cell approach and simply set Qq = before 
computing fluxes and updating the cell values, so that the same formula can 
then be used everywhere. With a 5-point stencil we need to fill two ghost 
cells at each boundary, and we set 

Qo” = Qn> Qn+1=Qi, Qn+2=Q2 (4.81) 

at the start of each time step. 

4.9.2 Outflow Boundaries. Often we have artificial computational bound- 
aries that arise simply because we can only solve the problem on a bounded 
domain. At such boundaries we often want to have no incoming signal, while 
there may be out-going waves that should leave the domain cleanly with- 
out generating spurious reflections at the artificial boundary. We thus want 
nonreflecting boundary conditions. At such boundaries we can often set 
ghost cell values by extrapolation from the interior solution. If the ghost 
cell value QJv+i is set based on Qat-u •••5 then the new value 

will effectively be computed on the basis of values to the left alone, even if 
the formula depends on Q5v-i-i> ^^d hence this reduces to some sort of up- 
wind method. The simplest approach is to use a zero-order extrapolation, 
meaning extrapolation by a constant function. We simply set 

Qn-\-i — Qn > Qn-x-2 = Qn (4.82) 
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at the start of each time step. The idea of extrapolation at outflow boundaries 
turns out to be extremely powerful in conjunction with methods based on 
solving Riemann problems. If there is no jump in the values at the boundary, 
there are no waves in the Riemann solution and in particular no incoming 
waves. 

4.9.3 Solid Walls and Symmetry. Now suppose x = 0 is a line of symme- 
try, or a solid wall. Both can be handled by the same procedure for the Euler 
equations. If the flow is initially symmetric about a; = 0, meaning that g and 
p are even functions while the velocity u is odd, then this will be maintained 
at later times. If we are solving the equations only for a; > 0, then we can 
specify values in the ghost cells according to 

Q? - = nq ^ , (4.83) 

where TZ represents the reflection operator which simply negates the velocity 
while preserving the mass and pressure. These are the values we would find 
in these cells if we performed the full calculation with symmetric initial data, 
and so setting the ghost cell values in this manner gives an identical solution 
within the domain a: > 0. 

Since velocity is an odd function, the velocity will be zero right at the 
point a: = 0. For this reason the numerical boundary conditions (4.83) are 
also correct for a solid wall, where u = 0 is the only physical boundary 
condition to be imposed. Note that when solving the Riemann problem with 
this symmetric data, the resulting symmetric similarity solution will have a 
1-wave and 3-wave which are symmetric, and a contact discontinuity which 
must travel at speed zero. Hence the velocity between the wall and the 3- wave 
moving into the computational domain will be zero, as we expect physically. 
The in-going wave will be shock if the velocity in the first cell was negative 
(flow into the wall) or a rarefaction wave if that velocity was positive (flow 
away from the wall). 



5. Source Terms and Fractional Steps 

Many “conservation laws” involve source terms, as discussed in Sect. 2.3. 

There are various ways to handle these terms, which fall into two basic cat- 
egories: 

- Unsplit methods, in which a single finite-difference formula is developed to 
advance the full equation over one time step. 

— Fractional step (splitting) methods, in which the problem is broken 
down into pieces corresponding to the different processes, and a numer- 
ical method appropriate for each separate piece is applied independently. 
This approach is also often used to split multi-dimensional problems into a 
sequence of one-dimensional problems {dimensional splitting, as discussed 
in Sect. 6.). 
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To illustrate, we begin with a simple advection-reaction equation of the 
form 

qt + uQx = -Ag , (5.1) 

with data g(a:,0) = q^{x). This would model, for example, the transport of 
a radioactive material in a fluid flowing at constant speed u down a pipe. 
The material decays as it flows along, at rate A. We can easily compute the 
exact solution of (5.1), since along the characteristic dx/dt = u we have 
dq/dt = -Ag, and hence 

q{x^t) = e~^^q^{x — ut) . (5.2) 

5.1 Unsplit Methods 

It is easy to develop unsplit methods for (5.1). For example, an obvious 
extension of the upwind method for advection would be (assuming u > 0), 

= Q?~ y (Qr - Q?-i) - kXQ? . (5.3) 

This method is first-order accurate and stable for 0 < uk/h < 1. 

A second order Lax-Wendroff style method can be developed by using 
the Taylor series 

q{x,t + k) q{x,t) + kqt{x,t) + ^k'^qtt{x,t) . (5.4) 

As in the derivation of Lax-Wendroff, we must compute qtt from the PDE. 
Differentiating qt gives 

qtt “ '^qxt ^qt ? qtx ~ '^Qxx A^x ? 
and combining these, we obtain 

qtt = u^qxx + 2tiA^x + A^^ . (5.5) 

Using this expression for qu in (5.4) gives 

q{x, t-\-k) ^ q - k{uqx -f Xq) -f \k^{u^qxx + 2uAgx + X^q) . . 

= [l- kX+ q~ ku{l - |A:A) q^ + ^k'^u^qxx 

We can now approximate x-derivatives by finite differences to obtain the 
second-order method 

= (l_A:A+iA2A2)Qr-§(l-^fcA)(Qr+i-Qr-i)(5.7) 
+ ^(Qr-i-2Q”+Qr+i)- 

Note that in order to correctly model the equation (5.1) to second order 
accuracy, we must properly model the interaction between the uqx and the 
Xq terms, which brings in the mixed term — Taylor series 

expansion. 
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5.2 Fractional Step Methods 

A fractional step method for (5.1) is applied by first splitting the equation 
into two subproblems which can be solved independently. For the advection- 
reaction problem (5.1) we might take these to be: 

Problem A: qt + uq^ = 0 , (5.8) 

Problem B: qt — ~Xq . (5.9) 

The idea with the fractional step method is to combine these by applying 
the two methods in an alternating manner. This has great advantage over at- 
tempting to deriv6 an unsplit method, especially for more complicated prob- 
lems. If we split the general problem qt + f{q)x = t^(^) into the homogeneous 
conservation law and a simple ODE, then we can use standard methods for 
each. In particular, the high-resolution shock-capturing methods can be used 
directly for the homogeneous conservation law, whereas trying to develop an 
unsplit method based on the same ideas while incorporating the source term 
directly can be more difficult. 

As a simple example of the fractional step procedure, suppose we use the 
upwind method for the A-step and forward Euler for the ODE in the B-step 
for the advection-reaction problem. Then the simplest fractional step method 
over one time step would consist of the following 2 stages: 

A-step: Q* ^Qi - - Qi-i) . (5-10) 

B-step: = Q* - kXQ* . (5.11) 

Note that we first take a time step of length k with upwind, starting with 
initial data to obtain the intermediate value Q * . Then we talce a time step 
of length k using forward Euler, starting with the data Q* obtained from the 
first stage. 

It may seem that we have advanced the solution by time 2k after taking 
these two steps of length k. However, in each stage we used only some of the 
terms in the original PDE, and the two stages combined give a consistent 
approximation to solving the original PDE (5.1) over a single time step of 
length k. 

To check this consistency, we can combine the two stages by eliminating 
Q* to obtain a method in a more familiar form: 

=(l-fcA)Q‘ 

= (l-fcA)[Q”-u|(Q"-(?"_i)] (5.12) 

^Qf- u^iQf - Q"_i) - kXQf + - Q”_i) . 

The first three terms on the right-hand side agree with the unsplit method 
(5.3). The final term is 0{k‘^) (since (Q? - Qi_i)/h ^ q^ = 0(1)) and so 
a local truncation error analysis will show that this method, though slightly 
different from (5.3), is also consistent and first-order accurate on the original 
equation (5.1). 
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A natural question is whether we could improve the accuracy by using 
a more accurate method in each step. For example, suppose we use Lax- 
WendrofF in the A-step and the trapezoidal method, or the 2-stage Runge- 
Kutta method, in the B-step. Would we then obtain a second order accurate 
method for the original equation? For this particular equation, the answer 
is yes. In fact if we use p’th order accurate methods for each step, the re- 
sult will be a p’th order accurate method for the full original equation. But 
this equation is very speci 2 d in this regard. In general one would think that 
splitting the equation into pieces in this manner would introduce some error 
that depends on the size of the time step k and is indepenendent of how well 
we then approximate the subproblem in each step. In general this is true — 
there is a “splitting error” that in general would be 0{k) for the type of 
splitting used above, and so the resulting fractional step method will be only 
first-order accurate, no matter how well we then approximate each step. This 
will be analyzed in more detail below. 

For the case of equation (5.1) there is no splitting error. This follows from 
the observation that we can solve (5.1) over any time period k by first solving 
equation (5.8) over time fc, and then using the result as data to solve the 
equation (5.9) over time k. Physically we can interpret this as follows. Think 
of the original equation as modeling a radioactive tracer that is advecting 
with constant speed u (carried along in a fluid, say) and also decaying with 
rate A. Since the decay properties are independent of the position x, we can 
think of first advecting the tracer over time k without allowing any decay, 
and then holding the fluid and tracer stationary while we allow it to decay 
for time k. We will get the same result, and this is what we have done in the 
fractional step method. 

On the other hand, suppose we modify the equation slightly so that the 
decay rate A depends on x, 

Qt + UQx ~ “A(x)g . (5.13) 

Then our previous argument for the lack of a splitting error breaks down 
— advecting the tracer a distance uk and then allowing it to decay, with 
rates given by the values of A at the final positions, will not in general give 
the same result as if the decays occurs continuously as it advects, using the 
instantaneous rate given by A(x) at each point passed. In this case there 
would be a nonzero splitting error. 

5.3 General Formulation of Fractional Step Methods 

Consider a more general PDE of the form 

qt = {A^B)q, (5.14) 

where A and B may be differential operators, e.g., A — -udx and B = A(x) 
in the previous example. For simplicity suppose that A and B do not depend 
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explicitly on t, e.g;., A(x) is a function of x but not of t. Then we can compute 
that 

(Rt = (-4 T B)qt = (-4 + B)‘^q , 
and in general 

diq = {A + Byq. 

We can then write the solution at time k using Taylor series as 

q{x, k) = q{x, 0) + + B)q{x, 0) + ^k^{A + BYq{x, 0) H 

= (/ + k{A + B) + \k\A + + . . .) q{x, 0) (5.15) 

which formally could be written as 
g(x,A:) =e*(^+®)g(x,0) . 

With the fractional step method, we instead compute 
q*{x,k) = e*‘-^q{x,0) , 
and then 

g**(x,fc) =e*=V-^g(x,0) , 
and so the splitting error is 

q{x, k) - q**{x, k) = q{x, 0) . (5.16) 

This should be calculated using the Taylor series expansions. We have (5.15) 
already, while 

q**{x,k) + kB+^k^B^ + --^ (^I + kA + ^k‘^A^ + --^ q{x,0) 

= (^I + k{A + B) + \k\A^ + 2BA + ^2) + • • q{x,Q) . (5.17) 




(5.19) 
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The splitting error is zero only in the special case when the differential oper- 
ators A and B commute (in which case it turns out that all the higher-order 
terms in the splitting error also vanish). 

If we now design a fractional step method based on this splitting, we will 
see that the splitting error alone will introduce an O(k^) error in each time 
step, which can be expected to accumulate to an 0(k) error after the T/k 
time steps needed to reach some fixed time T (in the best case, assuming the 
method is stable). Hence even if we solve each subproblem exactly within the 
fractional step method, the resulting method will be only first-order accurate. 
If the subproblems are actually solved with numerical methods that are p’th 
order accurate, the solution will still only be first-order accurate no matter 
how large p is. 

The above form of fractional step method is sometimes called the Go- 
dunov splitting, which in general is only first-order accurate. It turns out 
that a slight modification of the splitting idea will yield second order accu- 
racy quite generally (assuming each subproblem is solved with a method of 
at least this accuracy). The idea is to solve the first subproblem qt = Au over 
only a half time step of length k/2. Then we use the result as data for a full 
time step on the second subproblem qt = Bq^ and finally take another half 
time step on qt = Au. We can equally well reverse the roles of A and B here. 
This approach is often called Strang splitting as it was popularized in a 
paper by Strang [231] on solving multi-dimensional problems. 

To analyze the Strang splitting, note that we are now approximating the 
solution operator by Taylor series expansion of this 

product shows that 






(/ + ^kA + + • • ■) (j + fcS + + • • •) 

(/ + ^kA + + •••) (5.20) 

I + k{A + B) + ^*2(^2 + ^ 4. ^ 



Comparing with (5.15), we seee that the 0{k^) term is now captured correctly. 
The 0{k^) term is not correct in general, however, unless AB = BA. 

Note that over several time steps we can simplify the expression obtained 
with the Strang splitting. After n steps we have 



Qn ^ j qO (5 2I) 



repeated n times. Dropping the parentheses and noting that 02 *^ 02 *'^ = e*'^, 
we obtain 



Qn ^ ^\kA^kB^kA^kB^kA . ^^kB^^kAqO 



(5.22) 
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This differs from the first-order splitting only in the fact that we start and 
end with a half time step on A, rather than starting with a full step and 
ending with B. 

The fact that the Strang splitting is so similar to the first-order splitting 
suggests that the first-order splitting is not really so bad, and in fact it is 
not. While formally only first-order accurate, the errors made are analogous to 
evaluating the solution at slightly the wrong time, off by single time step, say. 
While this is enough to make the results formally only first-order accurate, 
relative to the exact solution at this particular time, the quality of the solution 
has not suffered at all. For problems of the sort we are considering here, 
where the solution involves discontinuities such as shock waves, the resolution 
achieved is every bit as good with the Godunov splitting as with Strang 
splitting. In practice the Godunov splitting is often easier to use in terms of 
specifying correct boundary conditions, and also when using variable time- 
step methods as is often done in practice. 

5.4 Stiff Source Terms 

Source terms sometimes model phenomena which occur on much faster time 
scales than we are attempting to resolve with our time step fc, and perhaps 
also act over much smaller spatial scales than our grid can resolve. In this 
case the source terms are said to be stiff, by analogy with the stiff equations 
discussed in Sect. 4.1.6. 

5.4.1 A Combustion Model. Stiff source terms that are not treated care- 
fully can lead to serious numerical difficulties. One such example is combus- 
tion, where chemical reactions (or nuclear reactions in stars) may occur on 
much faster time scales than the gas flow, much faster even than high-speed 
detonation waves. Computations may produce waves that look reasonable at 
first glance and yet are propagating at nonphysical speeds due to purely nu- 
merical artifacts. This was observed in a simple model combustion problem 
studied by Colella, Majda, and Roy tburd [58] . 

Although in practical problems there may be dozens of chemical species 
of importance, in this model there are only two chemical species: “unburnt 
gas” and “burnt gas”. By using g as one of our variables, we can eliminate 
the continuity equation for burnt gas and be left with the Euler equations for 
gu and E together with an additional continuity equation for the unburnt 
gas. We let Z represent the mass fraction of the unburnt gas and assume that 
unburnt gas is converted to burnt gas via a simple decay process of the form 

unburnt gas — > burnt gas , 

where K{T) represents the reaction rate of the burning process. In general 
the reaction rate depends on the temperature T via some Arrhenius relation. 
Typically the reaction rate is very large when T is sufficiently high but neg- 
ligible for small T. For simplicity we can approximate this by an “ignition 
temperature” kinetics model, in which the Arrhenius behavior is idealized to 
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Here To is the ignition temperature and the rate constant for large T is 
expressed in the form 1/r where r is the time scale of the chemical reac- 
tion. After ignition we expect exponential decay of the unburnt fraction like 
exp{—t/r) (ignoring the fluid dynamics). 

When this kinetics model is combined with the Euler equations we obtain 



{g)t + (^)i 

{gu)t + {qu^ +p)x 

Et + {u{E + p))x 

(gZ)t + {quZ)x 



0 , 

0 , 

0 , 

-K{T)eZ . 



(5.24) 



For simplicity we assume that both the unburnt and burnt gases are ideal 
gases with the same ratio of specific heats 7 and temperature T — p/glZ. 
The equation of state is modified by the fact that the unburnt gas contains 
chemical energy that is released as heat in the process of burning. The total 
energy E takes the form 

E = — -h -f- qogZ , (5.25) 

7 — i z 

where qo is the heat release. 

We consider the particular case of a Chapman- Jouguet detonation wave^ 
See, e.g., [53], [60], [97], [257] for more discussion of combustion in gas 
dyanamics, and also Muller’s notes on nuclear burning and supernova ex- 
plosions. For r > 0 we expect some region of finite width across which the 
reactions take place, but we expect this region to be very thin when r is 
small. Independently, Zel’dovich, von Neumann, and Doring made the fol- 
lowing assumption about the structure of a detonation wave with finite rate 
chemistry, which gives the so-called ZND structure. The detonation wave 
consists of an ordinary fluid dynamic shock followed by a deflagration wave. 
The shock raises the temperature of the unburned gas to a value greater than 
To. Through the deflagration wave, the gas burns so that Z decreases from 1 
to 0 while the pressure and density decrease to their final equilibrium values. 
This ZND structure is illustrated in Fig. 5.1 for the particular form of the 
reactions given in (5.24) with the ignition temperature kinetics (5.23). 

The width of the reaction zone is proportional to r. In order to resolve this 
structure numerically it is necessary to use a sufficiently fine grid that several 
grid points are in the reaction zone, and a small enough time step that the 
rapid reaction can be tracked. In practice, however, r is typically very small 
relative to the time scale of primary interest computationally, which is the 
time scale on which the reaction front moves (and is independent of r). For 
very small r we might be content to obtain the C-J limit computationally, 
a jump discontinuity linking the correct equilibrium values and traveling at 
the correct speed, but without the correct ZND structure. 
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Applying the fractional step approach to the equations (5.24), we first 
apply a conservative method to the homogeneous equations that result by 
setting K{T) =0 in (5.24). This gives the Euler equations together with an 
advection equation for Z. To solve the Riemann problem for this system we 
compute pressures pi and Pr from the conserved quantities and the equation 
of state (noting that Zi and Zr may be different). Based on pi^ pr^ ui and 
Ur, we solve the Riemann problem as indicated in Sect. 3.4. We then set 
Z == Z; to the left of the contact discontinuity and Z = Zr to the right of 
the contact discontinuity, since Z is simply advected with the fluid velocity 
in the nonreacting case. This produces the solution to the Riemann problem 
for the system of 4 equations. Using this solution to the Riemann problem, 
we can apply Godunov’s method or a high resolution method to produce U* 
from [/”. 

In the next step we solve the reaction equation in each grid cell. Since 
the source terms are zero for all equations except the Z equation, we obtain 
gTi+i _ ^ _ i^gu)* and — E*. The Z equation becomes 

Z, = -K{T)Z , (5.26) 

since g is held constant in this stage of the splitting. Ordinarily we would 
apply a stable numerical method to the resulting kinetics equations, but 
here we have a sufficiently simple equation that we can in fact use the exact 
solution. We then obtain 

= exp{-K{T*)k) z; , (5.27) 

where Tj*' — Pj / {g*j7Z) is the temperature in the jth cell after the first stage of 
the splitting. Note that we are assuming K{T) remains constant during this 
stage of the splitting in order to obtain the true solution (5.27). Although g, 
gu, and E remain constant, the pressure p and hence the temperature T will 
vary with Z. However, for an ignition kinetics model it is easy to check that 
T remains either above or below Tq throughout this stage, so that K{T) is 
constant. (If T* < To then Z^ =:= 0 so T is constant, while if T* > To then 
Z is decreasing which implies that p and hence T are increasing.) It follows 
that (5.27) does in fact solve (5.26) over the time step k. 

5.4.2 Nonphysical Wave Speeds. Figures 5.1 and 5.2 show the results 
of two computations with this approach on the equations (5.24), using Go- 
dunov’s method for the conservation law stage. In each case we used initial 
data corresponding to the ZND profile linking the states 

gi = 1.4 gr = 0.887564 
ui = 0 Ur ~ —0.577350 

pi = l = 0.191709 

Zi=0 Z^ = 1 , 

with 7 = 1.4, 7Z = 1, qo = 1, and Tq = 0.22. We used 200 grid points with 
h = 0.01 and took k = 0.005, giving a Courant number between 0.5 and 1. 
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Fig. 5.1. True ZND structure (solid line) and numerical solution to the combustion 
problem with r = 0.1 
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Fig. 5.2. True ZND structure (solid line) cind numerical solution to the combustion 
problem with r = 10“® 




In Fig. 5.1 we use r = 0.1 so that the ZND profile is well resolved on the 
grid. The solution is shown at i = 0.4. Although there is some smearing of 
the pressure peak due to the dissipation of Godunov’s method, the wave form 
is essentially correct and is moving at the correct speed s = 1, 

In Fig. 5.2 we use r = 10~®. The ZND profile can no longer be resolved on 
the grid. In the exact solution the reaction wave should still move at speed 
s = 1, but now the numerical solution behaves in a manner that is entirely 
nonphysical. There is a numerical wave traveling at speed 2 in which all of 
the chemical energy is released, followed by an ordinary shock at a slower 
speed. 

Note that the fast wave seen here is moving at a speed that corresponds 
to one grid cell per time step, i.e., at speed 2 = h/k. If we change the mesh 
ratio then this wave speed changes to the new value h/k^ showing that this 
wave is a purely numerical artifact. Similar experiments are reported in [58]. 

To understand what goes wrong here, consider Fig. 5.3, which indicates 
the solution to the Riemann problem arising at an initially sharp detonation 
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wave (so Z = 0 to the left of this interface and Z = 1 to the right). Detonation 
results if the shock wave raises the temperature of the gas to a value above 
the ignition temperature. Physically it is the gas between the shock and 
the contact discontinuity that should then be burned in the second half of 
the fractional step method, since it is only in this region that there is both 
unburned gas and also a sufficiently high temperature. 



n C S 




Fig. 5.3. Solution to the Riemann problem in the combustion or radiation exam- 
pies. The source terms should be active only between the shock wave, marked <S, 
and the contact discontinuity, marked C 



In the standard fractional step method, however, we first update cell aver- 
ages using this Riemann solution (and perhaps high-resolution corrections) , 
and then apply the source terms based on the updated cell averages. The 
problem of incorrect waves speeds arises from the fact that if the new cell 
average yields a temperature greater then the ignition temperature, and if 
k/r ^ 1, then all of the unburnt gas in this cell will be burned in this time 
step, not just the gas that has been heated by the shock. The result is that 
the interface between burnt and unburnt gas moves over by one full grid cell, 
yielding a combustion wave that propagates at the speed of one cell per time 
step. 

This phenomenon was discussed and analyzed in the context of an even 
simpler scalar equation in [148]. A more interesting scalar model is discussed 
in [140]. Fryxell, Muller, and Arnett [94] also demonstrate some of the diffi- 
culties one may encounter when computing detonations numerically. See also 
Muller’s notes in this volume. 

Although the potential difficulty with stiff source terms has been discussed 
in the context of fractional step methods, it should be stressed that similar 
problems can also arise with unsplit methods (which also have numerical 
viscosity) and it is not the fractional step approach that is at fault. This is 
demonstrated in [148] for the scalar model problem. 

On the other hand, in order to properly model problems with troublesome 
stiff source terms it is necessary to better model the coupling between the 
source term and the fluid dynamics. 

One possible solution to this problem would be to implement the frac- 
tional step method slightly differently. Rather than basing the source term 
update on the new cell averages computed after solving the conservation law. 
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one could compute the source term contributions in the process of solving the 
Riemann problem, e.g., by determining how much gas should actually burn 
in the above example. This could then be used to update Z after complet- 
ing the conservation law updates. This approach has recently been explored 
by Christiane Helzel from Magdeburg, who has obtained some encouraging 
results (see Fig. 5.4 for an example.) 
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Fig. 5.4. True ZND structure (solid line) and numerical solution to the combustion 
problem with r = 10“®, using a modified method in which the source term solver 
takes into account the structure of the Riemann solution. 



Other approaches also involve solving some kind of generalized Riemann 
problem where the source terms are taken into account. A wide variety of 
different approaches to handling stiff source terms have been explored in the 
recent literature. A few examples are [24], [120], [184], 

One can also avoid problems by making sure that the grid is sufficiently 
fine, and the time step small enough, that the source terms are fully resolved. 
For r fixed, no matter how small, the fractional step method will converge 
as A:, ft 0. But this is what we typically want to avoid in the “stiff” case, 
if possible. If this is not possible to avoid, then adaptive mesh refinement 
will certainly be crucial to efficiently solve the problem (see Sect. 6.3 below). 

Shock tracking or front tracking (see Sect. 6.3.1) may also be a useful tech- 
nique in this context, see [38], [39], [145]. 

5.4.3 Isothermal Equations and Radiation. Consider a molecular cloud 
that radiates heat as mentioned in Sect. 2.4. If a shock wave passes through 
such a cloud, then the molecules will be heated when passing through the 
shock. But this heat will then be so quickly radiated away relative to the fluid 
dynamic time scales, and the width of the radiation zone so narrow relative to 
the spatial domain of interest, that we can lump the shock wave and radiation 
zone together into a single isothermal shock and use the simpler isothermal 
equations to model the flow. 

If, however, we do use the full Euler equations together with a source term 
for the thermal radiation, then the source terms will be very stiff, as in the 
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combustion example above. However, this does not appear to lead to the same 
numerical difficulties of incorrect propagation speeds, and a simple splitting 
method can be successfully employed for this case. This is presumably not 
true for radiation problems in general (see the notes of Mihalas and Dorfi), 
but it is interesting to investigate this particular example since it sheds more 
light on the nature of the splitting error. 

Again consider Fig. 5.3, but now suppose we are solving the Euler equa- 
tions with radiation. Again it is only the region behind the shock where the 
source terms should be active, now by radiating heat, since only this gas has 
been heated by the shock. In the standard fractional step method we first 
average the energy and other conserved quantities over the entire grid cell 
and then radiate the heat, rather than radiating it from the smaller post- 
shock region. However, in this case we have not made a catastrophic error 
by doing the averaging first. The averaging spreads the heat over a larger 
region, but the total amount of energy to be radiated is the same either way, 
regardless of whether we have a whole cell with a high temperature or a thin 
zone with a very high temperature. In the combustion problem, by contrast, 
this makes a big difference since the gas burns equally well at any sufficiently 
high temperature, and so the larger region of high temperature leads to more 
gas being burned. 

Again the issue of how well the splitting method works comes down to a 
question of whether certain processes commute — in this case we need to in- 
vestigate whether the averaging process commutes with the process modeled 
by the source term. This is discussed by Pember[183] in a slightly differ- 
ent guise. He considers the conservation law with both a source term having 
time scale r and also a viscous term with coefficient e (a viscous regulariza- 
tion of the equation). His conclusion is that the stiff source term may cause a 
problem numerically if the limiting behavior of solutions to this equation as 
e, r — > 0 depends on which parameter vanishes more rapidly. This is relevant 
because physically we are trying to model the case where e — 0 and r > 0 
(but very small), whereas numerically e may be much larger than r due to 
numerical viscosity. With Godunov’s method this numerical viscosity arises 
from the averaging process, and so this is consistent with the analysis above. 

5.5 Quasi- stationary Flow and Gravity 

Another difficulty with source terms arises when the flow is nearly stationary, 
so Qt ^ 0, In one space dimension we then have 

f{q)x ~ V'(9) , (5.28) 

but it may be that the two expressions in (5.28) are both large. The source 
term is roughly balanced by the divergence of the flux. As an example, con- 
sider the isothermal equations in a one-dimensional tube oriented vertically, 
with gravity acting in the negative x-direction. This force appears in the 
momentum equation and we have 
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Qt + ieu)x = 0 , (5.29) 

(^u)t + {qu^ + o?q)x = -9Q , 

where g is the gravitational constant. In equilibrium, the velocity is u = 0 
while there is a gradient in the pressure p = a^g that balances the force of 
gravity, yielding an exponentially varying density, 

g{x) = goexp{-gx/a^) . (5.30) 

Suppose we start with this state initially and apply a splitting method. In 
the first step we solve the homogeneous conservation law, ignoring gravity. 
The density variation will lead to an acceleration of the fluid. In the second 
step we modify the momentum using the source term, and we hope that this 
will restore the initial state in order for equilibrium to be maintained. We are 
relying on two large changes to cancel out, and clearly they won’t exactly. 
The Strang splitting can be used to obtain a second order accurate method, 
but the error constant may be quite large. 

This causes serious difficulties if we are attempting to model the propa- 
gation of small perturbations against the background of a nearly stationary 
flow. This is a common situation in modeling stellar atmosphere dynamics, 
where the self-gravity of a star leads to power-law variations in the density 
and pressure that serves as a background for smaller amplitude dynamics. 

In the context of (5.29), suppose we wish to compute the propagation of 
a small amplitude acoustic wave through the exponentially stratified equilib- 
rium. This signal will be an insignificant component in the solution of the 
Riemann problems in the homogeneous problem, since the density jumps will 
be dominated by the exponential variation. While the second step may nearly 
restore equilibrium, the signal can be totally corrupted in the process. 

Various approaches are discussed in the literature for dealing with quasi- 
stationary problems of the type discussed above. One possibility is to view the 
cell averages as defining a piecewise stationary solution instead of a piecewise 
constant or piecewise linear function as we have done so far. The form of the 
solution in each grid cell is chosen so that (5.28) is roughly satisfied in the cell, 
a piecewise exponential function, for example. As a simpler approximation 
we might use a piecewise linear function with the slope Qx chosen so that 

f{q)qx « V’(9) • 

This piecewise stationary solution is then used in conjunction with a gener- 
alized Riemann solver that resolves the jump at each interface taking into 
account the variation nearby. See [84], [95], [96], [250] for some methods 
based on this approach. 

Here I will discuss a different approach that is related to this but easier 
to implement, and which also allows high-resolution correction terms to be 
computed that axe based on variations in the perturbations from equilibrium. 

Rather than determining a smooth function in each cell whose derivative 
matches the source term, we will reconstruct a piecewise constant function 
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in the z’th cell that has a single discontinuity from to Q'l at the middle 
of the cell. For simplicity we assume 

Q- 5il2 , Qt = Qi + <5i/2 , (5.31) 

for some vector to be determined below. Now suppose we apply Godunov’s 
method. The data is still piecewise constant and the jumps at cell interfaces, 
between and Q'^ ^ give rise to waves propagating into the two neighbor- 
ing cells as usual. The Riemann problem in the middle of a grid cell gives a 
fan of waves which, for small time, remain entirely in the single cell. Hence 
we do not need to compute the detailed solution to this Riemann problem, 
since we easily compute that its contribution to the cell average is simply 
^UiQt) ” f{QT))' R-^ther than using a splitting method, we will also in- 
clude the source term in an unsplit manner. For clarity suppose we are solving 
a linear hyperbolic problem with a source term x/;, so that f{q) = Aq and 
the solution of the Riemann problem is easy to determine. Then the method 
takes the form 

= Qf-^[A^iQ--QU) + A~{Q-^,-Qt)] (5.32) 

-^{AQt-AQT) + kxljiQi). 

We have not yet discussed how to choose 5. The form of (5.32) suggests choos- 
ing it so that the term ^{AQf — AQ~) which arises from the discontinuity 
introduced at the cell center exactly cancels out the source term kip{Qi). 
Then the second line of (5.32) would vanish and the method reduces to what 
looks like Godunov’s method based only on jumps at the interface, with no 
source term. But the jumps have been modified so the Riemann problem 
solved at each interface uses data Qf_i and Q~ rather than the original cell 
averages. High-resolution correction terms can now be based on the waves re- 
sulting from these Riemann problems. As Fig. 5.5 indicates, a large variation 
in Q that is primarily balanced by a source term will lead to small residual 
jumps Q~ — Qf_i at the cell interfaces. 




Fig. 5.5. (a) Introducing a jump of size S at the center of each grid cell gives a 
new piecewise constant reconstruction, (b) Solving the Riemann problem at each 
jump. Three Riemann solutions affect the cell value Qi. The flux difference from 
the Riemann problem at the cell center balances the source term 
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For a general nonlinear flux function, requiring that the second line of 
(5.32) vanish gives the condition 

= (5.33) 

which mimics (5.28). Splitting Qi in this manner thus gives a piecewise con- 
stant approximation to the equilibrium solution within the cell, while pre- 
serving the piecewise constant nature near the cell interfaces where Riemann 
problems must be solved. The resulting jumps at the interfaces correspond 
to perturbations from the equilibrium solution, as desired. 

For a linear system, combining (5.33) and (5.31) and solving for S gives 

S = hA-^xpiQi) . (5.34) 

We use this to define the modified states in (5.31) and then the simplified 
form of (5.32) gives 

= Q 7 - - Qt-i) + ^-(Qr+i - )) • (5.35) 

This is the form used in the implementation since it avoids the source terms 
altogether. It is this form that can now be extended to a high-resolution 
method by using waves based on the modified states. 

It is interesting, however, to rewrite (5.35) in the following form, obtained 
by using the expression (5.34) in (5.31), 

= Q 7 -^iA+iQi-Qi-i) + A-(Qi+^-Qi)) (5.36) 

k{A~^ A A ^t/ji+1/2) j 

where 

^i-i /2 = + ^{Qi)) • 

In this form we can interpret the first-order method as being simply the 
standard Godunov method with a special form of the source term which 
consists of a certain combination of interface values V^t_i/2 and V^i+1/2 instead 
of Note that 

A-^A-^ = RA-^A-^R^^ = RI^R-^ , 

where is the diagonal matrix with Vs on the diagonal in positions corre- 
sponding to positive eigenvalues of A and zeros elsewhere. Hence the decom- 
position 

ipi-1/2 = + A~ A~^xpi_i/2 

splits i^i-i /2 into components that are carried to the right and left respec- 
tively by the characteristics. In (5.36) we use the incoming portion of ^ from 
each cell edge to define the source term used in the cell. 
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Fig. 5.6. Solution to the isothermal equations with gravity. Very weak acoustic 
waves are visible which axe amplified in Fig. 5.7 below 

This approach is frequently used in the literature (e.g., [82], [164]) and 
has the additional advantage that the first-order accurate method (5,36) gives 
a second-order accurate steady state solution [198]. The derivation given here 
shows how this approach can be unified with the idea of a piecewise steady 
reconstruction. In adding the high-resolution terms it seems highly desirable 
to start with the form (5.35) rather than (5.36). 

For a nonlinear problem it may not be so easy to determine 5i so that 
(5,33) is satisfied. Consider the isothermal equations (5.29), for example. Let 
g be the density and m — gu he the momentum in the Fth cell. We must 
take m~ — = m since there is no source term in the conservation of mass 

equation, and then we need to choose ^ ± (5/2 so that 

(m^/^^ -h d^g'^) - {m^ !g~ 4- o?g~) = -hgg . 

In equilibrium, m~0 and this reduces to a linear equation for 6 with solution 
S ~ —hggfd^. In general clearing the denominator gives a cubic equation for 
5, and near equilibrium this has a single real root which can be easily found 
by iteration using the equilibrium value as an initial guess. 

Figure 5.7 shows some sample calculations on the isothermal system with 
solid walls at the two ends of a vertical tube of length 1, using a = = 1 

and the equlibrium state g{x) — e~^. The equilibrium density is perturbed 
by 10“^ over 0.45 < X < 0.55 initially, giving rise to weak acoustic waves 
propagating in both directions. These are barely visible in the plot of the 
density shown in Fig. 5.6. Figure 5.7 shows the true and computed density 
perturbations at time t = 0.15 with the two different approaches. Here the 
background exponential has been subtracted out. The solid line shows the 
“true” acoustic pulses as computed on a much finer grid. 

In Fig. 5.7(a) the Strang splitting has been used. The perturbation is not 
very well captured. Near the perturbation we see overshoots and undershoots 
in spite of the fact that a flux-limiter method is used. The limiter is applied to 
the waves that arise in solving the homogeneous isothermal equations, which 
include the exponential variation as well as the perturbation. 
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Fig. 5.7. Solution to the isothermal equations with gravity. The density perturba- 
tion from the equilibrium solution g{x) = e~^ is plotted. Note these perturbations 
are of the order 10~^ relative to the equilibrium, (a) Using the fractional step 
method, (b) Using a method of the form (5.35) 
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We also see perturbations of a similar magnitude propagating in from 
the boundaries. These arise from the fact that the solid wall boundary con- 
ditions which are correct for the full problem are not the correct boundary 
conditions to impose for the homogeneous conservation law in the fractional 
step method. Developing appropriate boundary conditions for intermediate 
time steps in a fractional step method can be an important component in 
achieving full accuracy, see [139] for one example. In this problem these per- 
turbations from the boundary, though small in relation to the exponentially 
varying equilibrium solution, will overwhelm the waves we are attempting to 
capture at later times. 

In Fig. 5.7(b) the approach described above is used instead, and the waves 
are well captured, at later times as well as at the time shown. This approach 
is developed more fully in [143]. 



6. Multi-dimensional Problems 

The derivation of multi-dimensional conservation laws was discussed in 
Sect. 2.5 and now we will look at the hyperbolic structure of such problems, 
and numerical methods for their solution. For a linear problem in two space 
dimensions, the system of equations takes the form 

Qt + Aqx -h Bqy = 0 . (6.1) 

This problem is said to be hyperbolic if the matrix cos 6 A + sin 6B is diago- 
nalizable with real eigenvalues for every choice of 6. Then taking plane- wave 
initial data which varies only in one direction (say the ^-direction) at angle 
0 to the x-axis leads to a one-dimensional hyperbolic equation of the form 

qt -1- {cosOA -f smOB)q^ = 0 . 
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The nonlinear conservation law Qt+f{Q)x‘^9{Q)y = 0 is hyperbolic at some 
state q if the above condition is satisfied for the Jacobian matrices A = f\q) 
and B — ^'((/). The Euler equations (2.33), for example, are hyperbolic in 2 
or 3 dimensions away from the vacuum state. 

The two-dimensional advection equation takes the form 

qt + uq^ + vqy = 0 , ( 6 . 2 ) 

where (a, v) is the velocity vector. This equation is hyperbolic and the solu- 
tion simply translates along (u,t;). 

In one space dimension we can diagonalize a general linear hyperbolic 
equation using the matrix of eigenvectors, decoupling it into independent 
scalar advection equations for each characteristic variable. For a linear sys- 
tem in more dimensions, this can only be done if the coefficient matrices 
commute, e.g., if AB = BA in (6.1), in which case the matrices have the 
same eigenvectors. Then A and B can be simultaneously diagonalized by the 
eigenvector matrix R: 

^ = RAR~^ , B ^ RMR-^ , 

where A = diag(A^ , . . . , A’^) and M = diag(//^ , . . . , ii^) contain the eigenval- 
ues, which may be different. The system in (6.1) can then be diagonalized to 
obtain 

qt 4- Aqx + Mqy = 0 ^ 

yielding independent advection equations. The p’th equation has the velocity 
vector (A^,/i^). 

If AB ^ BA, then there is no single transformation that will simultane- 
ously diagonalize A and B, The equations are genuinely coupled. This is the 
usual situation physically. Consider linear acoustics, for example. This can 
be modelled by a linear system of 3 equations in two space dimensions: 
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Here p is the pressure perturbation and (w, t?) the velocity, while and po 
are the background density and pressure. The mat rix co sOA + sin^H then 
has eigenvalues 0, ±c for all values of 0, where c = y/jp/g is the sound speed, 
which is the same in all directions. 

The multi-dimensional Euler equations have shock waves that travel 
isotropically in all directions. For some problems we may be able to reduce 
the complexity of the numerical problem substantially by taking advantage 
of symmetry. For example, if we are solving a problem where the solution 
is known to be radially symmetric, then we should be able to rewrite the 
equations in polar or spherical coordinates, obtaining a system that reduces 
to a problem in the single space dimension r. The transformed equations will 
typically involve geometric source terms. 




6. Multi-dimensioii 2 d Problems 103 



Even if the real problems of interest must be studied multidimensionally, 
radially-symmetric solutions are very valuable in testing and validating nu- 
merical codes. A highly accurate solution to the one-dimensional problem 
can be computed on a fine grid and used to test solutions computed with the 
multi-dimensional solver. This is useful not only in checking that the code 
gives essentially the correct answer in at least some special cases, but also 
helps to determine whether the numerical method is isotropic or suffers from 
grid- orientation effects that lead to the results being better resolved in some 
directions than in others. See [134], [144], [147] for some examples of such 
numerical tests. 

6.1 Dimensional Splitting 

The simplest way to extend the finite volume methods developed in one 
dimension to more dimensions is to use dimensional splitting. The frac- 
tional step method described in Sect. 5. is applied to the multi-dimensional 
problem by splitting it into one-dimensional subproblems. For example, the 
two-dimensional linear problem (6.1) would be split into 

x-sweeps : qt -h Aqx = 0 , 

y-sweeps : qt + Bqy — 0 . 

In the x-sweeps we would start with cell averages at time tn and solve 
one-dimensional problems qt -f- Aq^ = 0 along each row with j fixed, updating 
Q^j to Q*j, In the y-sweeps we then use these Qlj values as data for solving 
qt + Bqy ~ 0 along each row with i fixed, which results in 

Note that there will generally be a splitting error unless the operators A = 
Adx and B — Bdy commute, i.e., unless AB = BA, Only in the case where 
the multi-dimensional problem decouples into scaJar advection equations can 
we use dimensional splitting with no splitting error. 

However, the splitting error is often no worse than the errors introduced 
by the numerical methods in each sweep, and dimensional splitting can be a 
very effective approach. It gives a simple and relatively inexpensive way to 
extend one-dimensional high-resolution methods to 2 or 3 dimensions. 

6.2 Multi-dimensional Finite Volume Methods 

Alternatively, finite- volume methods can be developed that are based directly 
on the multi-dimensional integral form of conservation law, e.g.(2.28) in three 
dimensions. The cell averages are then updated due to fluxes through all faces 
of the grid cell, and the fluxes are all computed based on the current cell 
averages rather than first updating by sweeps in one direction and then the 
other. In two space dimensions a conservative flux-differencing formulation 
would have the form 
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QT = • 

The simplest Godunov method of this form consists in solving the one- 
dimensional Riemann problem normal to each cell face in order to compute 
the flux. Extending this method to second-order accuracy requires incorpo- 
rating not only slopes in the normal direction, as in the one-dimensional algo- 
rithm, but also transverse derivative terms that arise from cross-derivatives in 
a Taylor series expansion. With the Strang splitting these terms would arise 
naturally from the fractional step procedure, but with an unsplit method 
they must be explicitly modeled. A wide variety of unsplit methods have 
been developed, e.g,, [22], [56], [85], [200], [205], [219], [220], [243], [266], 

A multi-dimensional version of the wave- propagation algorithms described 
above has recently been developed, yielding a class of algorithms that fol- 
low fairly simply from the one-dimensional ideas and yet appear to be quite 
broadly applicable, fairly robust, and stable up to Courant numbers of 1 rel- 
ative to the maximum wave speed in any direction. These algorithms were 
described in the course, but this material will not be repeated here since 
a complete description (in two space dimensions) has recently appeared in 

[ 144 ]. 

These algorithms are also implemented in the CLAWPACK software [138] , 
in a manner that I hope can be comprehended and will aid in understanding 
the algorithms as well as providing a code that can be applied on many 
hyperbolic systems. This package also contains three-dimensional extensions 
as described in [133], [134]. 



6.3 Grids and Adaptive Refinement 

In many technical fields where computational fluid dynamics is used, a major 
computational difficulty arises from the fact that the physical geometry can 
be very complex in 2 or 3 space dimensions. Examples include aerodynami- 
cal calculations of the flow around an airplane or inside a gas turbine. One 
must first determine a suitable grid on which to perform the calculation, and 
various issues arise in generating suitable grids and then solving the equa- 
tions efficiently and accurately on such grids. Many books on CFD and grid 
generation discuss this issues in detail. 

In astrophysical calculations the geometry is often quite simple by com- 
parison, and many intersting calculations can be done in a rectangular domain 
using Cartesian coordinates, or perhaps with simple spherical coordinates. In 
these notes I will not discuss the grid generation problem at all. 

There are, however, often a wide range of spatial and temporal scales 
present, so that efficient calculation generally requires the use of a nonuniform 
grid even if the geometry is simple. To resolve a shock wave clesmly may 
require a fairly fine grid near the shock, though one should remember that 
a shock from a hyperbolic system is an idealization and typically it will 
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not be resolved exactly as a discontinuity no matter how fine the grid is, 
unless perhaps we employ “shock tracking” as described below. If we add 
viscous terms to the equations, or have reactions or radiation effects that 
are active over very thin regions, then we may need to resolve the physically 
correct width of these transition regions in order to obtain correct solutions, 
as discussed in Sect. 5.4. 

6.3.1 Moving Grids and Front Tracking. One possibility is to adjust 
the computational grid in each time step so that it conforms to the features 
of interest, by moving a fixed set of grid points. One approach is to use a 
Lagrangian formulation of the equations of motion, in which the computa- 
tional grid moves with the fluid. Muller has discussed this to some extent 
in his lectures. This can lead to grid tangling problems when there are large 
defomations of the fluid, and often such approaches are combined with fixed 
Eulerian grids through some form of remapping procedure. 

Another approach is to move the grid points in such a way that they 
are clustered near the regions of rapid variation, without necessarily moving 
them at the fluid velocity. Dorfi’s lectures contain a discussion of one such 
method for 1-dimensional (spherically symmetric) radiation hydrodynamics. 
See also [74], [105]. 

If the feature of interest is a shock wave that should be a sharp discon- 
tinuity, then instead of clustering many points near the shock in order to 
resolve it well with a standard finite- volume method, another approach is to 
keep track of the location of the shock itself and deal with the motion of the 
shock in some special manner. This is called shock tracking or more gener- 
ally front tracking since this approach can also be applied to more general 
discontinuities in q that may not be shocks, e.g., contact discontinuities or 
phase boundaries. 

In one space dimension this requires introducing just one additional grid 
point for the shock location. In solving the Riemann problem at this point 
we can determine the exact speed of the shock and adjust this grid point 
accordingly. In more than one space dimension, front tracking can get quite 
complicated, particularly when the shocks develop interesting structure or 
begin to interact with one another, or when two shocks collide, 

A variety of front-tracking methods of have been developed. See [145] 
[146] for one approach related to the wave propagation algorithms described 
here. The references in these papers point to many more examples. 

By contrast, methods of the type developed in these lectures are generally 
called shock capturing methods, in that they attempt to capture shocks 
and other discontinuities nicely on a fixed grid, rather than adjusting the grid 
to track the shock. Shock-capturing methods are much simpler to implement 
than shock-tracking methods, particularly in more than one space dimension. 
However, in order to capture features such as shock waves very sharply it is 
often necessary to have a rather fine grid. 




106 R.J. LeVeque 



6.3.2 Adaptive Mesh Refinement. Rather than allowing grid points to 
move, another approach to clustering grid points where they are needed is 
to refine a uniform grid in regions where more resolution is needed. This 
approach can be used both for capturing shocks well and for introducing 
fine grids in regions of rapid smooth transition. Such an approach should be 
automatically adaptive in that it chooses the regions of refinement based on 
the behavior of the solution, and for time-dependent problems the regions of 
refinement must change with time. 

I will discuss adaptive mesh refinement (AMR) in the context of Cartesian 
grids, though the same approaches can be applied on logically-rectangular 
curvilinear grids. 

There are two main strategies that have been used to handle the data 
structures involved when refining a rectangular grid. One approach is to re- 
fine individual grid cells as needed, typically by splitting a single cell in two 
dimensions into four pieces. Each of these pieces may be further subdivided 
recursively, depending on how many levels of refinement are allow’ed. A conve- 
nient way to keep track of these cells in two dimensions is to use a quad-tree 
structure, a tree in which each node corresponds to a grid cell and those 
which are subdivided have four leaves corresponding to the subcells. In three 
dimensions cells would be subdivided into 8 pieces, leading to an oct-tree 
data structure. 

The paper [99] shows an example of such a grid in 3D, used to solve the 
MHD equations for the interaction of the solar wind with a comet. This 
calculation illustrates another reason why adaptive mesh refinement can be 
essential when dealing with a variety of spatial scales. The comet in this cal- 
culation is 10km across, while the full computational domain is 50 x 50 x 75 
millions of kilometers. 16 levels of refinement have been used so that the 
smallest grid cells, near the comet, are roughly 48km on a side. Even at this 
resolution, the comet itself is a point source within one grid cell, but this is 
adequate to give the correct global solution. It would be absolutely impos- 
sible, however, to use this resolution over the entire computational domain. 
See [68] for another application of this adaptive MHD code, in calculating 
the magnetic field around Venus. 

Another approach is to refine rectangular patches of the computational 
domain rather than individual cells, again recursively applying this idea to al- 
low increased resolution in some regions. This approach, illustrated in Fig, 6.1, 
was pioneered by Berger and 01iger[27] and has been developed further by 
Berger and coworkers [19] [25] [28] [26]. 

Refinement on rectangular patches has the advantage that the data struc- 
tures remain relatively simple, and consist of a nested set of grids. On each 
grid a standard finite- volume method is used to sweep over the grid, though a 
certain amount of additional work must then be done at the interface between 
grids to maintain accuracy and global conservation. Refinement of individ- 
ual cells requires more work per cell in advancing the solution, but has the 
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Fig. 6.1. Adaptive mesh refinement on rectangular patches for a spherical Riemann 
problem. Initially the pressure is 10 inside a sphere of radius 0.5 and 1 outside, with 
constant density p = 1 and zero velocity, (a) shows the adaptively refined grid. 
Refinement by a factor of 2 is used in Level 2 and by an additional factor 4 in Level 
3. (b) shows contours of density 
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advantage that fewer grid cells need to be refined since the refinement can 
really be focused where it is most needed. When refining on patches, the cells 
which are flagged as needing refinement must be clustered into rectangular 
patches. See [19] for more details on the refinement and clustering algorithms. 
Powell [186] gives some additional comparison of these approaches. 

When refining in space and then solving the hyperbolic equation, we also 
generally want to refine in time by the same factor. If we refine by a factor of 
4, for example, then we should take 4 time steps on the refined patch for each 
time step on the coarse grid. This insures that we are using a comparable 
Courant number on each grid and avoids the need for very small time steps 
on the coarsest grids. 

At an interface between coarse and fine grids, we must also insure that 
the formulas used to update the solution on each grid are consistent with 
one another. In particular, when a conservation law is being solved we must 
preserve global conservation. See [28], [30] for details on how this is done. 

Berger^s work has recently been combined with the CLAWPACK code to 
create AMRCLAw[29], [30], which can be used on quite general hyperbolic 
systems. Figure 6,1 shows a sample calculation using amrclaw. A spherical 
Riemann problem is solved in which the gas is initially stationary with uni- 
form density but with higher pressure (and temperature) inside a sphere of 
radius 0.5, The data agrees with that used for the one-dimensional Riemann 
problem shown in Fig. 2.2 and for short times roughly the same structure 
is seen in the radial direction. The outward-moving shock and contact dis- 
continuity are separated by a thin shell of high-density gas that can only be 
resolved on a fairly fine grid. The computation shown W2ts done in two space 
dimensions using the AMRCLAW package, including source terms for cylindri- 
cal symmetry about the y-axis (handled by the fractional step approach). 

Figure 6.1(a) shows the grid structure at time t = 0.043. In this computa- 
tion two levels of refined grids have been introduced, refining first by a factor 
of 2 and then by a factor of 4 in going to the finest level. Regridding every 
few time steps based on Richardson extrapolation error estimates allows the 
region of refinement to follow the thin structure. 

Figure 6.1(b) shows contours of the density at this time, along with out- 
lines of the individual rectangular grid patches. We can see that both the 
shock and contact discontinuity are well resolved. The solution also appears 
to be radially symmetric, with no ‘^grid orientation” effects that sometimes 
plague less accurate methods (Sect. 7.6). This is also clearly seen in Fig. 6.1(c), 
which shows a scatter plot of the density Qij in every grid cell plotted against 
Tij , the distance from the center of the cell to the origin. The density profile is 
similar to what is seen in Fig. 2.2 for the one- dimensional Riemann problem 
with the same data. 

Berger’s code has previously been adapted to astrophysical problems by 
Rolf Walder, who has studied colliding flows in binary-star systems [177], 
[253]. Figure 6.2 shows one such calculation in two space dimensions with 
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Fig. 6.2. Adaptive mesh refinement calculation of colliding winds between binary 
stars. Five levels of mesh refinement axe used and th^e solutions on the finest four 
levels axe shown. Note that Level 5 is used only in the interaction region directly 
between the stars. Calculation by R. Walder 
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Fig. 6.3. Several particle paths are shown to indicate mass accreting from a red 
giant star to a companion white dwarf. Adaptive refinement was used to concentrate 
the grid in the region of interest in this three-dimensional calculation by R. Walder 



five levels of refinement. Figure 2.5 shows results from a three-dimensional 
calculation. 

Figure 6.3 shows the results of another three-dimensional calculation by 
Walder where AMR was a crucial ingredient. A Red Giant / White Dwarf pair 
is modeled, in which the wind velocity for the red giant is comparable to the 
angular velocity of the rotating system. This results in turbulent flow near the 
white dwarf, where mass is accreting. Figure 6.3 shows several particle paths 
from the three-dimensional simulation. AMR is used to resolve the flow in the 
neighborhood of the white dwarf without the need for an excessively fine grid 
everywhere. Five levels of refinement were used, with refinement by factors of 
2 or 4. A total of 861,184 cells were used. A uniform grid calculation with the 
same resolution as the finest grid would have required 2560^ = 1.68 x 10^^ 
cells and 64 times as many time steps in some regions. A uniform calculation 
might be more accurate, but this calculation is sufficient to give a rough 
estimate of what might happen in such an accretion process. For a deeper 
insight, one would also need to add viscosity and a turbulence model. 
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Adaptive mesh refinement is a powerful technique for solving astrophys- 
ical problems efficiently, especially for problems in three space dimensions. 
Related AMR methods have been used elsewhere in computational astro- 
physics, e.g., [48], [75], [163], [228]. 

7. Computational Difficulties 

The high-resolution finite methods discussed in these lectures can be used 
very successfully for a wide range of problems. They are not foolproof, how- 
ever, and one should never accept computed results without a critical study 
of their accuracy. This is often hard to assess for complex problems where 
the exact solution is not known, but the following techniques can help: 

- Investigate simple cases where exact solutions might be known, or at least 
the correct qualitative behavior of the solution is well understood. 

“ Reduce the number of space dimensions by considering radially symmetric 
solutions, for example. Then a fine-grid solution in one space dimension 
can be used as a reference solution for the multi-dimensional solution. 

- Perform grid refinement studies on the real problem of interest. If you 
refine the grid does the solution remain basically the same? If not, then 
you probably cannot trust either solution. (If so, both solutions may still 
be completely incorrect. An error in the code that changes the equations 
might lead to a method that converges very nicely to a solution of the 
wrong equation.) 

A number of specific difficulties that can arise in solving the Euler equa- 
tions have already been mentioned, such as 

- The use of a nonconservative method can give shocks that look reasonable 
but which travel at the wrong speed (Sect. 4.3). 

- Stiff source terms can lead to similar results (Sect. 5.4). 

- The computed solution may not satisfy the entropy condition, leading to 
discontinuities where there should be a smooth rarefaction (Sect. 4.6.4). 

A number of other problems are frequently encountered in solving the 
Euler equations using high-resolution methods, and in the following sections 
a few of these are briefly discussed. (See Quirk [190] for some additional dif- 
ficulties.) Along with alerting the newcomer to a few potential pitfalls, an 
exploration of these difficulties should offer further proof that a good under- 
standing of the mathematics and physics of these equations is essential in 
diagnosing and improving computational algorithms. 

7.1 Low-Density Flows 

In conservative formulations of the Euler equations, the total energy is one of 
the conserved quantities. The internal energy and pressure are computed from 
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this after subtracting out the kinetic energy determined from the momentum 
and density. In regions where the internal energy is very small compared 
with the kinetic energy, e.g., in high-speed, low-density flow, inaccuracies in 
the conserved quantities can easily lead to the kinetic energy exceeding the 
total energy, which results in the computed pressure being negative. This 
generally causes the code to crash, for example in the square root required 
to compute the sound speed. This problem is often dealt with by simply 
resetting negative pressures to some slightly positive value in the code, but 
it is preferable to use more robust methods that do not suffer this deficiency. 

Linearized Riemann solvers, such as the Roe solver discussed in Sect. 4.6.4, 
may also fail near low densities or pressures by returning intermediate states 
that are nonphysical. Einfeldt, Munz, Roe and Sjogreen [79] give an analysis 
of some of these problems. They call a method “positively conservative” if 
the internal energy always remains positive for any physical data, and show 
that a variant of the HLLE method (Sect. 4.6.3) is positively conservative. 

7.2 Discrete Shocks and Viscous Profiles 

A number of difficulties axe associated with the fact that the numerical ap- 
proximation to a shock wave is not a sharp discontinuity (unless shock track- 
ing is used) but rather is smeared over one or more grid cells. A single isolated 
shock in the p’th family consists, in principle, of only two values qi and Qr 
separated by a discontinuity. In phase space, Qr lies on the p’th branch of the 
Hugoniot locus of qi, and vice versa. The solution to the Riemann problem 
between these states consists of this one shock wave, with zero strength waves 
in the other families. A shock-capturing method, however, will introduce new 
values of q in the smeared region, and solving the Riemann problems involv- 
ing these new states will, for a nonlinear system, introduce spurious waves 
in the other families. In spite of this, and somewhat remarkably in view of 
this interpretation, high-resolution shock-capturing methods often produce 
discrete shocks that propagate as quite sharp representations of the correct 
shock, generating very little visible noise in the other families. This is largely 
because the dissipation in the numerical methods mimics the physical dissi- 
pation that is natural for such systems, e.g., viscosity and heat conduction. A 
shock wave is a mathematical idealization of a thin transition, which results 
in a smooth curve of states between qi and qr in phase space (the viscous 
profile) . As long as the numerical viscosity is of sufficient magnitude and of 
a reasonable form, the discrete shock may behave similarly to the physical 
shock, without generating noise in the other families. 

In certain circumstances, however, this noise can be very noticeable and 
gives rise to various numerical phenomena that are frequently observed with 
the Euler equations and other systems. In particular, a wave that is moving 
at a slow speed s, so that sk/h is much less than 1 for this wave, experi- 
ences little numerical dissipation with many high-resolution methods. The 
next three sections discuss some situations where this lack of dissipation can 
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lead to spurious noise since the numerical method is no longer modeling the 
dissipative aspect of the physics properly. All of the phenomena mentioned in 
the next few sections can be minimized by using a more dissipative method, 
though perhaps at the expense of some sharpness in the resolution of shocks 
and particularly contact discontinuities. Extra dissipation must be added ju- 
diciously where needed. For example, Marquina’s flux- vector splitting method 
described in Sect. 4.8.3 puts in extra dissipation when the eigenvalue changes 
sign from one cell to the next, which signals that the shock speed may be 
close to zero. Donat and Marquina [73] show some comparisons for the type 
of problems discussed below. 

For conservation laws other than the Euler equations, there may be ad- 
ditional trouble if the form of numerical dissipation introduced by a given 
method is not modeling a reasonable physical dissipation. For some nonlin- 
ear systems, different forms of dissipation can lead to different weak solutions 
in the limit of vanishing dissipation, in which case numerical methods must 
be used with particular caution. 



7.3 Start-Up Errors 



Numerical noise of the type described in the previous section is often observed 
in studying the propagation of an isolated shock numerically. If the data is 
chosen as a sharp discontinuity, then relatively large-magnitude waves in the 
other families are initially generated, before the discrete shock settles down 
into a viscous traveling wave. 

Consider one step of Godunov’s method, for example, on data 



{ qi if i < 0 

\ Qr if z > 1 . 



Solving the Riemann problem at interface i = I gives the single shock wave 
propagating at some speed s > 0, say. Then 



{ qi if z < 0 

Qm if i = 1 

Qr if i > 1 , 



where 



sic 

qm = qr- -^{qr 



- qi) = (1 - v)qr + uqi , 



with u = sk/h. The CFL condition requires < 1 and in fact < 1 in general 
since the corresponding wave speed A will be larger than s near state qi or qr . 
Hence qm is a convex combination of qi and qr which lies on a straight line 
connecting these two states in phase space. In the next time step, we must 
solve Riemann problems between qi and qm aJid between qm smd qr- 

For a linear system, the Hugoniot loci are straight lines, so no waves in 
other families will be generated. The shock will become smeared, but will not 
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Fig. 7.1. Start-up error resulting from a Riemann problem with an isolated shock 
initially at x = 0.5. (a) Density (b) Pressure 



generate noise in the other families. For a nonlinear problem, on the other 
hand, the Hugoniot loci are not straight lines and so qm lies off of the locus 
connecting qi and qr- Solving these new Riemann problems then results in 
waves in the other characteristic families along with new approximations to 
parts of the original shock. 

As a result, starting a shock- wave computation for the Euler equations 
with piecewise constant data of this sort will typically result in a spurious 
acoustic wave in the other nonlinear family and a spurious entropy wave. The 
magnitude of these waves depends on the strength of the initial shock, since 
they result from the nonlinearity of the Hugoniot locus. The spurious acoustic 
wave typically dissipates quickly and may not be noticed at all. The entropy 
wave, however, is often observed, particularly if the fluid velocity behind the 
shock is small. The entropy wave is simply a variation in density moving at 
this velocity, which gives rise to a contact discontinuity in the solution to 
any Riemann problem. If the velocity is small then this wave has little effect 
on the neighboring cells and experiences little numerical dissipation. Any 
variations in density remain essentially stationary, an artifact of the initial 
conditions which is often called start-up error. 

Figure 7.1 shows an example when the high-resolution method from 
Sect. 4.7 with the superbee limiter is used on the data 

Qi = 3.86, ui = 0.0, Pi = 10.33, for a: < 0.5 , . 

rr Ij Ur = ~2.63, Pr — 1, for X > 0.5 , 

which gives only a 3-wave moving at velocity s — 0.92. Since the velocity 
behind the shock is zero, the density variation remains stationary at x == 0.5. 
There is also a weak acoustic 1-wave visible in the figure at about x = 0.1. 
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7.4 Wall Heating 

This same sort of artifact is also often seen when two shocks collide or when a 
shock reflects off a solid walk Consider two identical shocks approaching each 
other with zero velocity in between (which also models reflection at a wall 
halfway between the shocks — see Sect. 4,9.3). Each shock may have settled 
down to some numerical traveling wave that does not appear to generate any 
noise. But when the shocks collide, the result is two new out-going shocks 
with a different state in between than before the collision, with higher density 
and pressure but still zero velocity. During the interaction phase considerable 
noise will be generated in the other families, and in particular a spurious 
entropy wave will be generated which is then stationary in the zero- velocity 
region between the out-going shocks. This wave yields a dip in the density. 
The pressure, however, is nearly constant and so this dip in density results 
in an increase in the temperature T = pI'R'Q- The gas appears to have been 
heated at the point where the collision occurs. 

In particular, in computing the reflection of a shock off a solid wall, this 
spurious temperature rise occurs at the wall itself. This phenomenon is fre- 
quently observed in numerical simulations where shocks reflect off physical 
boundaries, and is known as wall heating in the literature. See, for example, 
[73], [175]. 

7.5 Slow-Moving Shocks 

Figure 7.2 shows the numerical solution to the same shock propagation prob- 
lem as in Fig. 7.1, but with the velocity shifted on each side so that the shock 
speed in this reference frame is much closer to zero. The data is the same 
as (7.1) but with 0.81 subtracted from each velocity. Otherwise, exactly the 
same method has been used as in Fig. 7.1, again with Courant number near 
0,9. The solution is shown at time t = 0.5, after 260 time steps. In this case 
the propagating shock does not settle down into a smooth traveling wave, 
but instead continuously generates strong acoustic and entropy waves, giving 
rise to some oscillations in the pressure as well as the density. Discussion 
and analysis of this problem can be found in many papers. Arora and Roe [7] 
show nice plots of this behavior in phase space, showing that the numerical 
waves can deviate substantially from the correct Hugoniot locus. Kami and 
Canic [125] develop a modifled equation analysis that illustrates the lack of 
numerical dissipation in this situation. See also [121], [193]. 

Note that the high-resolution method used here, with the superbee lim- 
iter, is a TVD method (see Sect. 4.7.5) on a scalar conservation law. Scalar 
shocks must be captured in a nonoscillatory fashion. For systems of equations 
there is no such guarantee in general. Developing high-resolution methods by 
extending the scalar methods as described in these notes has proved remark- 
ably effective, but is not entirely foolproof as these examples show. 
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Fig. 7.2. Oscillations generated in a slowly- moving shock. The shock was initially 
at X = 0.5 and has moved to x = 0.555 at time t = 0.5. (a) Density (b) Pressure 



7.6 Grid Orientation Effects 

When a shock-capturing method is applied in more than one space dimension, 
shocks are generally not exactly aligned with the computational grid. Instead 
they cut through the grid at some angle. The methods which have been dis- 
cussed here, based on solving one-dimensional Riemann problems normal to 
the cell interfaces, are using a series of Riemann problems in the coordinate 
directions to approximate a single shock at some angle to the grid. It is some- 
what surprising that this works at all, but in fact it is remarkably successful 
in general. There are, however, some situations in which the orientation of 
the computational grid can be directly observed in computational results. 
See [147] for some examples. However, using a higher-order method which 
includes appropriate cross-derivative terms from the Taylor series expansion 
typically eliminates much of this grid-orientation effect. This is generally not 
a serious problem except perhaps with very strong shocks or when a curved 
structure is very poorly resolved on the grid. 

7.7 Grid- Aligned Shocks 

In a multi-dimensional calculation it may seem advantageous to have a shock 
aligned with the grid so that Riemann problems normal to the cell edges 
are also in the physically correct direction. However, a shock that is nearly 
aligned with the grid can also suffer certain numerical instabilities that have 
no analog in one-dimensional calculations. Figure 7.3 shows density contours 
at a sequence of times for a colliding flow problem. Initially g = p ~ 1, 
= 0 everywhere, while the x- velocity is u = +20 on the left half of the 
domain and u = —20 on the right. This colliding flow should give rise to two 
symmetric shock waves propagating outwards. If the initial data is exactly 
uniform then the calculation will yield a reasonable approximation to this. 
For the calculation in Fig. 7.3, however, the density was initially perturbed 
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Fig. 7.3. Colliding winds give rise to two outgoing shock waves. An initial density 
perturbation of magnitude 10“® grows due to numerical instability 

to ^ = 1 -f 10“® in a single grid cell in the center. At time t = 0.08 the 
effect of this perturbation is just barely visible in the contour plot, though 
examining the deviation from an unperturbed calculation shows that it has 
grown to a magnitude of 0.3 and now covers a larger set of grid cells. By time 
^ = 0.12 its effect is clearly visible in the contour plot and there are order 1 
perturbations in the density that spread out from the center and completely 
destroy the shock structure. 

In this calculation, 7 = 1.05 and the simplest Godunov method has been 
used with the Roe solver and Courant number 0.45. The shocks are moving 
quite slowly; roughly 1000 time steps have been taken in Fig. 7.3. This type of 
numerical instability is seen only for strong shocks that are aligned with the 
grid, or nearly so. The instability develops as an oscillation on the grid scale 
in the ^/-direction, as seen in Fig. 7.3. Quirk [190] refers to this as odd-even 
decoupling and gives some analysis of this. It is in solving the Riemann 
problems in the 7/-direction, where nothing should be happening, that small 
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amplitude oscillations in density and pressure on the grid scale are magnified 
unstably. In the calculation of Fig. 7.3, these were excited by a perturbation at 
a single point (on one side of the initial discontinuity, hence the asymmetry in 
x). In an actual calculation with a strong shock nearly aligned with the grid, 
perturbations due to other flow features or even to rounding errors can lead 
to this instability developing simultaneously along the length of the shock, 
leading to a more clearly visible oscillatory structure at later times. 

The choice of Riemann solver affects this stability. Since the density per- 
turbations should be propagating at velocity v ^ 0 in the y-direction, meth- 
ods which use the Roe solver or even an exact solver will add very little 
numerical dissipation, as mentioned in Sect. 7.5, allowing the instability to 
grow. According to Quirk, the problem is eliminated using the HLLE solver, 
for example, and he discusses a hybrid approach where one switches to such 
a method near strong shocks. The Marquina solver (Sect. 4.8.3) also appears 
to be more stable in this situation. 

A shock propagating at an angle to the grid does not suffer from this same 
instability. Shock fronts are often curved, however, and will be tangent to the 
grid lines at some point. Where the shock is locally aligned with the grid, an 
instability of this same form can arise which remains confined to this region. 
This resembles the early stages of the instability shown in Fig. 7.3, giving 
a small protrusion ahead of the shock that is sometimes called a carbun- 
cle [190]. This has been observed in some calculations of radially expanding 
shocks, e.g., in supernova explosions, where instabilities can develop along 
the coordinate axes in a Cartesian calculation. This type of instability can 
also plague colliding wind calculations in binaries [253]. It is also seen in 
supersonic flow around blunt bodies, which produces a bow shock. If the bow 
shock is aligned with the grid along the stagnation line, where there is no 
transverse flow, then a carbuncle can form with some methods [190], [258]. 

Switching to a more diffusive method can alleviate this problem, but 
care must be taken to insure that physically correct instabilities are not also 
damped. For example, the high-density region which develops between the 
shock waves in the colliding flow of Fig. 7.3 is subject to the thin shell 
instability, which is physically relevant in many astrophysical problems, 
particularly for radiating shocks. See, e.g., [32], [70], [230], [251]. 



8. Magnetohydrodynamics 

In astrophysical problems one must often study the flow of ionized gases 
subject to magnetic fields. The field exerts a force on the gas while the motion 
of the gas perturbs the magnetic field. To model such flows it is necessary to 
use the equations of magnetohydrodynamics (MHD) to couple equations for 
the magnetic field to the hydrodynamic (actually gas dynamic) equations. 
This uses a simplified version of MaxwelFs equations to model the coupling. 
A simplified set is used because we only seek to model the motion of the 
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particles, not the propagation of electromagnetic waves on much faster time 
scales. 

In this section these equations will be briefly reviewed, in the context of 
hyperbolic theory, i.e., with emphasis on the characteristic structure, wave 
speeds, solution to the Riemann problem, etc. I will concentrate on pointing 
out how this system differs from the simpler Euler equations and what new 
difficulties arise compuationally. The presentation of the equations given here 
owes much to other sources, particularly [124], [199], The MHD equations 
are discussed in many texts, e.g., [118], [129], [218], 

8.1 The MHD Equations 



In MHD we assume that the flow is nonrelativistic and that we are interested 
in variations on a slow time scale. Then in Maxwell’s equations we can ignore 
the displacement current and approximate 
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jja -ux B , 


(8.2) 


and hence 
dB 
dt 


= -V X E 


(8.3) 




w V X (w X B) + rjV^B . 


(8.4) 



Explicit dependence on the electric field has been eliminated and we have an 
evolution equation for the magnetic field B that depends only on B and u, 
the velocity. Here r) is the resistivity and for ideal MHD we assume rj = 0 so 
that this dissipative term vanishes. 

These equations are also coupled with the condition 

V • H = 0 (8.5) 

from Maxwell’s equations: the divergence of the magnetic field must always 
be zero. This is a physical law that is assumed to always hold (barring the ex- 
istence of “magnetic monopoles”, which have never been observed). Solutions 
to the MHD equations will automatically maintain (8.5) for t > 0 provided 
the initial data satisfies (8.5) at t = 0, (The numerical solution, however, 
may not. This constraint must then generally be imposed separately on the 
numerical solution, as discussed in Sect. 8.6.) 

Setting T] = Q, we can rewrite (8.3) as 

dB 

-h V • {uB - Bu) — 0 , (8,6) 

This is coupled with the Euler equations for the evolution of u, with addi- 
tional coupling coming from the fact that the magnetic field exerts a force 
on the fluid, which appears in the momentum equation. This force is 
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j X B — -^{V X B) X B 

47T 

= ± - BV ■ b) , (8.7) 

where = B • B. In view of (8.5), the last term in (8.7) drops out and 
this is simply the divergence of the Maxwell stress tensor. Using this, the 
momentum equation can be written in conservation form as 



(^M)e + V • + l(p+ 



BB = 0 . 



( 8 . 8 ) 



The isotropic force enters in a form similar to the pressure and is called 
the magnetic pressure, while the tensor BB is the magnetic tension. 

The magnetic field also affects the energy equation. The total energy is 
now defined as 

E-e-\- ^q{u u) + ^B^ , 
where the internal energy e is still given by 



for a gamma-law gas. The energy fiux is given by 



E + P+ ) u - i~B{u ■ B) . 
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47T 



The new terms relative to the Euler equations result from the work done on 
the gas by the magnetic field. 

At this point it is convenient to rescale B to incorporate a factor of 
l/v7^ = l/\/47r into the definition of B, This eliminates all of these factors 
in the above equations and the ideal MHD equations are thus 
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This is in the conservation form (2.27). Writing out all of the components, 
we obtain 
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and similax expressions for the fluxes g and h in the y- and ^-directions. 
Here Bi, Byy and Bz are the three components of B, and subscripts do not 
indicate derivatives. 

8.2 One-Dimensional MHD 

Consider a plane-wave solution in which q{x, y, z, t) varies only with x and t. 
Then all y- and ^-derivatives are zero and we are left with the equations of 
one-dimensional MHD, 

qt + f{q)x = 0, (8.10) 

with q and f{q) as above. One thing to note immediately is that the fifth 
component of f{q) is zero, and hence the fifth equation of (8.10) is simply 




Any spatial variation in Bx remains stationary in time. This should seem 
nonphysical; one would expect all wave speeds to be relative to the speed u 
of the gas, by Galilean invariance. However, for purely one-dimensional MHD 
the condition (8.5) requires that -§^Bx = 0 since ^By = ^Bz = 0. Hence 
Bx must be spatially constant as well as temporally constant and so the wave 
speed is irrelevant. We can in fact drop the equation for Bx altogether, and 
view the constant value of 5^ as a parameter in the remaining system of 
seven equations for 

q = {q, eu, QV, QW, By, E) . 

Linearizing these equations allows us to determine the characteristic wave 
speeds. Computing the eigenvalues of the Jacobian matrix about some con- 
stant state yields 

Xi = u — Cf y X2 = u — ca , Xz = u — Cs y A4 = u , (8.12) 

A5 = tX -h Cg , Xq = U Ca y X7 = U Cf . 

Waves propagate at speeds c/, ca, Cg relative to the gas. These speeds are 
defined and desribed below. The middle wave with X4 = u travels at the 
fluid velocity and corresponds to the contact discontinuity as in the Euler 
equations. This wave can carry an arbitrary variation in density while all 
other variables are constant, i.e., the corresponding eigenvector is 

r'* = (1,0, 0,0, 0,0,0). 

Compare this with the three-dimensional Euler equations in the case of a 
plane wave, which gives a system of 5 equations with wave speeds 

X\ = u — a y X2 = X^ = X4 = u y A5 = IX “h d , 

where a is the sound speed. In this case there are three waves with speed 
u since the contact discontinuity can also carry arbitrary variation in the 




122 R.J. LeVeque 



transverse velocities v and w as well as in q. This shear velocity, assuming 
it varies only with x and not with y or 0 , is simply advected with the flow 
and has no eflFect on the fluid dynamics. (This is only true for an ideal fluid 
where we ignore viscosity and hence there is no resistence to shear motion.) 

In the MHD equations this is no longer true. In ideal MHD the field lines 
are “frozen into” the flow since the equation (8.3) implies that B is trans- 
ported with the fluid. Hence a shear motion will cause a distortion of the 
field lines and will result in a force on the fluid. In this sense MHD is more 
similar to the equations of elasticity from solid mechanics than to fluid me- 
chanics. An elastic solid supports two different types of waves: compressional 
waves (pressure waves, or P- waves) that are acoustic waves as in a fluid, 
and transverse waves (shear waves, or S- waves) that arise from the fact that 
a transverse displacement results in a restoring force from bonds between 
molecules that are absent in a fluid. These two types of waves propagate at 
distinct speeds, with the P-waves (primary waves) propagating faster than 
the S- waves (secondary waves). 

In MHD, the magnetic field resists transverse motion and hence there 
exist purely transverse waves in which u, and p are constant but the 
transverse velocities and magnetic fields vary. These are known as Alfven 
waves, which propagate at the Alfven velocity 

CA - V^xIf^OQ 

relative to the fluid velocity u. These are the 2-wave and 6-wave in the order- 
ing (8.12). Like the contact discontinuity, these waves are linearly degenerate 
since and hence the wave speed, is constant throughout the wave. 

There are two sets of waves remaining to discuss, propagating with speeds 
Cf and Cs relative to the gas. These are the magnet osonic waves which 
are nonlinear waves in the general Riemann problem that will typically have 
shock or rarefaction wave solutions. The two sets are called the slow and fast 
magnetosonic waves since Cs < ca < Cf. These waves involve compression of 
the gas. In certain cases one set of waves or the other reduces to simple 
acoustic waves, but in general there is a more complicated coupling of each 
with the magnetic field. 

Following [199], set 

— Bx^y^zl y/Q 5 ^ "b + ^2: 

and again let a — yyplQ be the sound speed. Then computing the eigenvalues 
of the Jacobian matrix, the fast and slow wave speeds can be found as the 
larger and smaller values of satisfying the equation 

- (a^ -h b^)c^ + - 0 . 

This yields 

^ ± V(a2+ 62)2 _ 4^252^ (8.13) 
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f A s u s A f 




Fig. 8.1. Wave structure for the MHD equations. The symbols f, A, and s denote 
fast, Alfven, slow waves, and u the contact discontinuity 



with /, s corresponding to H-, - respectively. Since in general < c/, 

the Alfven wave is also often called the intermediate wave. The wave 
structure is illustrated in Fig. 8.1. 

The magnetic field B in the state we are linearizing about lies at some 
angle to the x-axis. If this angle is 0 or 90 degrees (i.e., b± or is 0, respec- 
tively), then certain degeneracies arise as discussed below. In general 6x^0 
and these two directions define a unique third direction, orthogonal to both. 
It is in this direction that the magnetic field and velocity variations in an 
Alfven wave occur. Ji b_\_ = 0 then such transverse motions have 2 degrees 
of freedom in the entire y-z plane and one would expect 2 linearly indepen- 
dent waves traveling at the Alfven speed. In fact, in this case either Cs = ca 
or c/ = Cyi, as discussed further below, and one of the magnetosonic waves 
degenerates to a second Alfven wave. 

Except in this degenerate case, the fast and slow waves each have some 
compressional component. In ordinary gas dynamics a compressionaJ wave 
travels at the sound speed determined by ~ dp/dg at constant entropy. 
Fast and slow waves propagate similary, but the magnetic pressure must now 
be included in this expression and the magnetosonic speeds can be described 
by 

c^ = -^{p + B^). (8.14) 

The fluid pressure p increases with density, dp/dg > 0, but the magnetic 
pressure may either increase or decrease, depending on the orientation of 
B and the changes in velocity across the wave. This means that the mag- 
netosonic speed c given by (8.14) will either be greater than a or less than 
a, depending on whether changes in the fluid and magnetic pressure comple- 
ment one another or partially cancel out. These two cases yield fast and slow 
waves, respectively. 
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The degenerate cases bx — 0 and 6j_ = 0 are worth discussing in detail 
for two reasons. First, they give some indication of how these waves relate 
to gaS'dynamic waves in the Euler equations. Second, in these special cases 
several waves coalesce and the system is not strictly hyperbolic, which can 
create numerical difficulties as discussed in Sect. 8.4. Genuine nonlinearity 
can also be lost at these points, leading to nonconvexity. 

If bx^O but b^ = bl + bl= b\ is nonzero, so the magnetic field is com- 
pletely in the transverse direction, then the resistance to transverse motion 
described above disappears. We then have 

cj = -^b\ , Cs = ca = 0 . 

In this case the Alfven waves, slow waves, and contact discontinuity all col- 
lapse into a contact discontinuity and shear waves as in the Euler equations. 
The fast magnetosonic wave is similar to an acoustic wave but the transverse 
magnetic field affects the wave speed and structure. If we also let = 0, i.e., 
impose no magnetic field, then c; = a and we recover the ordinary acoustic 
waves. This is the hydrodynamic limit in which MHD reduces to the Euler 
equations. 

On the other hand, \{ b± = 0 while bx ^ 0, so that the magnetic field 
is entirely in the x-direction, then we expect two sets of Alfven waves and 
only one set of nonlinear waves. These should in fact be ordinary acoustic 
waves, which are unaffected by the magnetic field when B is aligned with the 
direction of compression. We can check this by computing the wave speeds 
from (8.13). If = 0 then 6^ = b^ and so 

c/,s = \{a^+f>l± \Z(a^ - 6?)^) ■ 

Now there are two cases to consider depending on how the Alfven speed 
Ca = 1 6a; I is related to the sound speed a, i.e., how strong the magnetic field 
is. 

If 6 _l = 0 and < a^, then 

Cj — (X ^ Cg ~ Ca — |6x| . 

In this case the slow magnetosonic wave degenerates to an Alfven wave, while 
the fast wave propagates at the acoustic speed, and is in fact an ordinary 
acoustic wave. Again note that if we also set 6^ = 0 we recover the structure 
of the Euler solution. 

If 6j_ = 0 and b\ > then 

Cj — Ca — |6x I , Cg — CL . 

In this case the slow wave is an ordinary acoustic wave, while the fast wave 
becomes an Alfven wave. 

The most interesting case arises when 6x = 0 and b\ = a^, in which case 

Cj — Cg — . 
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The two-dimensional families of Alfven waves propagate at the same speed 
as the acoustic waves. This is called the triple umbilic point (see Sect, 8.4). 



8.3 Solving the Riemann Problem 

The eigenvalues and eigenvectors of the Jacobian matrix derived above are 
useful in studying the propagation of small-amplitude disturbances about 
some fixed state. Solving the Riemann problem with a large-amplitude dis- 
continuity between the states is a different matter, however. For a strictly 
hyperbolic system, such as the Euler equations, we expect the solution to the 
m X m Riemann problem to consist of m waves, one corresponding to each 
eigenvalue of the linearized matrix. For the 2x2 isothermal system studied 
in Sect. 3.3, for example, the eigenvalues are = u - a, = u -h a, and the 
solution to the general Riemann problem consists of two waves that are easily 
identified with these two families. At each point in the relevant part of phase 
space (^ > 0) there are two distinct eigendirections and two distinct curves 
corresponding to 1- waves and 2- waves, determined by using the entropy con- 
dition to select the relevant portions of the integral curves and Hugoniot loci. 
These curves form a coordinate system for this portion of phase space, so 
that any two states can be connected in a unique way by a 1-wave followed 
by a 2- wave. A nonstrictly hyperbolic system can have a richer structure in 
solutions to the Riemann problem. 

8.4 Nonstrict Hyperbolicity 

If a system is not strictly hyperbolic then there are at least some points q in 
phase space where two or more eigenvalues of the Jacobian matrix are equal. 
If the problem is hyperbolic, then the matrix must still be diagonalizable and 
hence an eigenvalue of multiplicity r must have an r-dimensional eigenspace 
associated with it. At such a point there are infinitely many eigen-directions 
in phase space, leading to a possible breakdown in the structure mentioned 
above. 

The multi-dimensional Euler equations fail to be strictly hyperbolic ev- 
erywhere in phase space since A = u is an eigenvalue of multiplicity iV in 
space dimensions. The entropy wave and shear waves all travel at the fluid 
velocity. However, these are all linearly degenerate waves that behave com- 
pletely independently of one another and no complication is introduced by 
this. 

For nonlinear fields a multi-dimensional eigenspace can result in the num- 
ber of integral curves through such a point being different from the m we 
normally expect for an m x m system at a strictly hyperbolic point. This 
singularity in the vector field can lead to a global change in the structure of 
the integral curves and Hugoniot loci, and difficulties in solving the Riemann 
problem. 
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In some systems there are isolated points in phase space where strict 
hyperbolicity fails, called umbilic points [207]. In systems such as MHD 
there are curves or surfaces in phase space along which there are multiple 
eigenvalues, e.g., the surface = 0 or b± = 0. 

A simple 2x2 nonstrictly hyperbolic system illustrates some of the po- 
tential difficulties. The model problem discussed here is a special case of a 
system studied by Schaeffer and Shearer [207]. Brio and Rosenau[43] devel- 
oped a 3 X 3 extension of this as a model of intermediate waves and the triple 
umbilic point in MHD equations, which has also been studied by Myong [173]. 

Consider the 2x2 system 

+ + = 0, (8.15) 

Vt + {uv)x = 0 . 

The Jacobian matrix is 

fiq) = [ ^ ^ 

with eigenvalues u±v. For u ^ 0 the systems is strictly hyperbolic and has 
distinct eigen-directions (1,1) and (1,-1). Along the u-axis (where u = 0) 
in phase space the eigenvalues are both equal to u, the system fails to be 
strictly hyperbolic, and every vector is an eigenvector of f\q) = uL 

For the simple system (8.15), the integral curves and Hugoniot loci can 
be shown to agree, and are simply the lines with slope ±1 in the phase plane. 
These appear to behave well as we cross the u-axis. However, note that for 
u > 0 we have u + u > u — u, while for u < 0 we have u -h u < u — u, so our 
convention that forces us to define 

ifu<0 ifu<0 

— u ifu>0 ’ ifu>0 

with freedom in the labeling along u = 0. As a result, each integral curve 
takes a 90 degree turn as we cross the u-axis. This can happen only because 
every direction is an eigendirection along the u-axis. 

Now consider a Riemann problem with states on opposite sides of the 
u-axis, say qi — (o:,o:), qr = (i9,y3) with a > 0 > /3. The Rankine-Hugoniot 
jump conditions are satisfied between these states if we take s = a — (3. This 
jump, propagating at speed s, is a weak solution to the conservation laws. 
It is not so clear, however, whether it should be associated with 1-waves or 
2- waves, or whether such a wave is physically realizable. 

The situation with MHD is similar, though more complicated, and has a 
rich structure of “intermediate shocks” that are not easily identified with a 
particular family. In some cases the number of characteristics impinging on 
such a shock as time advances in the x-t plane is different from the m -h 1 
characteristics that are expected in the m x m strictly hyperbolic case (see 
Sect. 3.3.4), and the shocks are said to be overcompressive or undercompres- 
sive if the number is greater or less than the number we expect. 
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This is related to the fact that MHD is further complicated by a failure of 
genuine nonlinearity (Sect. 3.3.3) near points where strict hyperbolicity fails, 
leading to an even richer structure of compound waves that involve both shock 
and rarefaction waves in the same family traveling together. The existence of 
such waves was first discovered computationally by Brio and Wu[44], [259] 
who used one of the earliest Riemann-based methods for MHD, 

The analysis of intermediate waves in MHD is currently an active area of 
research, and there is still some controversy about what the correct admis- 
sibility criteria are for such waves, and which of these waves are physically 
observable. In this analysis one must remember that the ideal MHD equa- 
tions ignore a variety of physical effects, and the hyperbolic system is only an 
approximation to more correct systems of equations. In the Euler equations 
for an ideal gas, fluid viscosity and heat conduction are ignored. In MHD 
we have also ignored the dissipative effect of resistivity, and also dispersive 
effects due to the Hall current. 

There is a large literature on the topic of nonstrictly hyperbolic systems, 
for the MHD equations, in other applications such as oil recovery and non- 
linear elasticity where similar problems arise, and in more general mathemat- 
ical investigations. Along with the papers mentioned above, a small sample 
of interesting recent work includes [17], [42], [43], [91], [92], [114], [127], 
[154], [173], [207], [208], [209], [229], [261], [262]. Zachary, Malagoli, and 
Colella [265] discuss a numerical approach to dealing with nonstrict hyper- 
bolicity, based on ideas in [21], 

8.4.1 Artificial Resistivity. We obtain a hyperbolic problem in MHD by 
assuming the fluid is perfectly conducting, i.e., we ignore the dissipative effect 
of resistivity along with viscosity and heat conduction. The resulting hyper- 
bolic model is only useful if weak solutions obtained are in fact “vanishing 
resistivity” solutions to the more correct equations. For MHD the exact form 
of dissipation added can affect the wave structure. Numerical methods add 
some numerical resistivity and one might worry about whether the phys- 
ically correct solutions are obtained on problems where physical resistivity 
is important. The analysis of the correct structure is complicated by the 
loss of strict hyperbolicity and genuine nonlinearity, and some of the papers 
mentioned in the previous section discuss issues of correct viscosity criteria. 



8.5 Stiffness 

In some contexts the MHD equations are stiff in the sense that there are 
widely varying wave speeds in different paxts of the computational domain. 
In modeling the solar corona, for example, the density varies exponentially 
and so the waves speeds, which are proportional to 1/^^, will be orders of 
magnitude larger in some regions than others. If we use an explicit method 
with the same size time step everywhere, then the time step will be limited by 
the largest wave speed over the entire domain. In other parts of the domain 
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we will then be using a tiny Courant number. This is not only inefficient, but 
can lead to loss of accuracy in these regions. For such problems it may be 
necessary to develop an implicit method for solving the hyperbolic system, 
eg., [123], [151], [210], 



8.6 The Divergence of B 

Numerical solutions to the MHD equations may not satisfy the condition (8.5) 
that is automatically satisfied by the true solution. It should be maintained 
numerically to the level of the truncation error, but this can still have a 
severe impact on the numerical solution, as observed in early numerical work 
by Brackbill and Barnes [40]. A variety of approaches have been suggested to 
deal with this problem. 

One common approach (e.g., [191], [264]) is to use some form of diver- 
gence cleaning, which consists of projecting the numerical B field onto the 
space of divergence-free vector fields every few time steps, thus removing any 
nonzero divergence from the field. 

An arbitrary vector field V can be decomposed as the sum of a gradient 
and a curl, 

V = V0 + V X W (8.16) 

for some 0 and W. If we have such an orthogonal decomposition then V — V<j) 
has zero divergence and is the orthogonal projection of V onto the space 
of divergence-free vector fields, which are also called solenoidal This is the 
Hodge projection. We can compute (j) by taking the divergence of (8.16), 
which results in a Poisson problem for (j), 

V ■ V = . (8.17) 

The numerical algorithm for MHD with divergence-cleaning proceeds as 
follows. Given at the start of a time step, which includes magnetic field 
components B^ that are divergence- free, first update this by solving the 
hyperbolic system over the time step At, resulting in some new values Q*. 
This includes the updated field B* which will not in general be divergence- 
free. Now solve the elliptic equation 

= V-H" (8.18) 

for and then set 

and use these values in place of to define from Q*. In practice this 

projection may be done every few time steps rather than every step, since 
solving the elliptic equation can be expensive. Note that multi-dimensional 
elliptic equations are typically solved with iterative methods (see Sect. 1.3.2) 
which require an initial guess at the solution . We should take advantage 
of the fact that should be close to </>”, so that the solution from the 
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previous time step (or whenever divergence-cleaning was last performed) can 
be used as a good starting guess. 

This elliptic equation must be solved together with some boundary con- 
ditions for (j) on the boundary of the computational domain. Specifying ap- 
propriate boundary conditions can be a difficulty and careful thought must 
be given to this issue depending on the specific problem being solved. In 
studying fundamental phenomena, such as the behavior of certain instabil- 
ities in MHD (e.g., [203]), it may be possible to solve a simplified problem 
with periodic boundary conditions. In this case periodic boundary con- 
ditions could be used for (p which also allows fast Poisson solvers based on 
fast Fourier transforms to be used in place of iterative methods [233]. 

8.6.1 Incompressible Flow. Similar projection methods are commonly 
used in numerical algorithms for the incompressible Navier-Stokes equations, 
which are frequently used in studying the flow of liquids. Hence this literature 
is a good source of information on such methods. See [185], for example, for 
some references. A brief discussion of incompressible flow is worth a detour 
here since this also introduces some other new ideas that may be useful in 
astrophysical flow calculations. 

For an incompressible fluid the density g is constant and the continuity 
equation -f V • (^u) = 0 becomes 

Vu = 0. (8,19) 

Hence the velocity field must be divergence- free. The conservation of momen- 
tum equation has the form 

Ut + u • Vu -h Vp = , (8.20) 

where p is the viscosity (which typically cannot be ignored in modeling the 
flow of liquids). The incompressible Navier-Stokes equations (INS) con- 
sist of (8.20) together with the constraint (8.19). There is no energy equation 
and no equation of state. Instead the pressure p must be determined in each 
time step in such a way that (8.19) remains satisfied. There is no local evo- 
lution equation for p; it must be determined using this global constraint. 

One common approach is the projection method introduced in [51] 
(see also, e.g., [23], [20], [185]). In each time step one first advances (8.20) 
without the Vp term to obtain w*, i.e., we solve 

ut u ' Vu = (8.21) 

over time At, The resulting u* will not be divergence-free because of the 
nonlinear terms. 

Projecting u* onto the divergence-free space gives We solve 

Z\^V = V • u* 
and then set 

^ w* - AtVp^-^^ . 



( 8 . 22 ) 
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The scalar factor At is introduced into the potential function (0 = Atp) so 
that (8.22) can be viewed as an approximation to the equation 

Ut -h Vp = 0 . 

Solving this equation after solving (8.21) can be viewed as a fractional step 
approach (Sect. 5.) to the full INS equations, and the p obtained in this 
manner approximates the pressure. 

Note that for INS it is not fundamentally unphysical to have a velocity 
field that fails to satisfy (8.19) — such velocity fields arise in compressible 
fiow all the time and a real liquid is slightly compressible. One could model 
a liquid more properly by using the compressible Navier-Stokes equations 
with a suitable equation of state. However, for many applications the acoustic 
waves that would then arise are of no interest and all the motion of interest 
is on the much slower fluid time scale. Using the compressible equations 
would require taking much smaller time steps than desirable to resolve the 
uninteresting acoustic waves — again a problem of stiffness. 

While the individual acoustic waves may not be of interest, their net ef- 
fect is important. It is the small amplitude, rapidly moving, acoustic waves 
that mediate pressure changes throughout the liquid and cause a force ex- 
erted at one point in the fluid to be felt nearly instantly (relative to the 
fluid time scale) far away. In the incompressible equations we have filtered 
out the acoustic waves, but maintained their effect by introducing an elliptic 
equation for the pressure. This can be viewed as imposing a global ‘^equation 
of state” which requires that the fluid behave incompressibly. The elliptic 
equation couples all points together and hence allows information to propa- 
gate infinitely quickly throughout the entire domain. The pressure can also 
be viewed formally as a Lagrange multiplier which imposes the constraint 
(8.19) in a variational formulation of the INS equations. 

In the case of the MHD equations, there does not appear to be an anal- 
ogous interpretation of the divergence-cleaning procedure, since in theory 
V • JB = 0 should be exactly satisfied. The numerical method, however, may 
introduce nonphysical magnetic monopoles which require a nonphysical po- 
tential (j) for their removal. 

8.7 Riemann Problems in Multi-dimensional MHD 

As discussed in Sect. 8.2, the one-dimensional MHD equations contain the 
equation -§iBx = 0, which can be eliminated to give the standard system of 
seven equations (8.10) with seven physical waves. In a true one-dimensional 
plane wave there can also be no spatial variations in Bx^ so it is reasonable 
to drop this variable and the corresponding equation. 

However, in applying a multi-dimensional numerical algorithm of the type 
described in Sect. 6., we must solve one-dimensional Riemann problems at 
each cell interface with data which comes from the two cell averages on either 
side. We wish to solve the one-dimensional MHD equations in the direction 
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n normal to this interface, but there is no reason that the two cell averages 
should have the same value of B • n, when these values come from multi- 
dimensional data. 

Hence at the cell interface we have two sets of data qi and qr for the one- 
dimensional Riemann problem that do not have the same value for the normal 
component of the magnetic field, say for x-sweeps. The one-dimensional 
Riemann problem does not formally have a solution, then, as a strictly one- 
dimensional problem. The data does not satisfy V • .B = 0 but rather has a 
singularity along the interface. To compute a solution using the 7- wave one- 
dimensional solution described above, we must first decide on some average 
value of Bx to use as the parameter in this solution. 

Typically in practice an approximate Riemann solver is used, such as a 
Roe solver (Sect. 4.6.4) which is based on the eigenstructure of an approxi- 
mate Jacobian matrix, determined by some averaged state based on qi and 
qr^ In this case it is quite natural to think of averaging the two values of Bx 
in the definition of this matrix along with the other variables. 

Brio and Wu [44] discuss a Roe solver for MHD in the case 7 = 2 which 
is based on -weighted averages as in the Euler equations. For 'y ^ 2 they 
used a simpler averaging since their approach did not appear to yield an 
average Jacobian satisfying all the conditions of a Roe matrix in the general 
case. Other approximate Riemann solvers for MHD are discussed in [11], 
[15], [64], 



8.8 Staggered Grids 

Another approach to maintaining the divergence-free condition in MHD is 
to use a staggered grid in which the magnetic field variables are stored at 
cell edges rather than cell centers. In two dimensions, for example, the values 
of Bx would be stored at interfaces in the x-direction while By is stored at 
interfaces in the y-direction. Denote these values by 

^x,i±l/2,j H'Ud By ij^ij2 . 

Then the discrete divergence-free condition amounts to requiring that 

- Bx^i-i/2j) + “ ^y,tj-i/2) = 0 . (8.23) 

A variety of algorithms have been developed that update these edge values 
in a manner that exactly preserves the condition (8.23) from one time step 
to the next. See, for example, [63], [69], [228]. 

A similar staggered-grid approach is often used in incompressible flow, 
where the velocities Wi±i/ 2 ,j and Vij±ij 2 are maintained at cell edges while 
the pressure is a cell-centered value. This is often called a MAC grid, since an 
early application of this approach was in the Marker-and-Cell method [102]. 
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8.9 The 8- Wave Riemann Solver 



The standard derivation of the MHD equations uses the fact that V • B = 0 
to eliminate certain terms along the way, as in dropping the last term of 
(8.7), for example. If one does not make this assumption at the start, then it 
is possible to derive more general equations of the form 
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in place of (8.9). A derivation due to M. Vinokur is given in [186]. This looks 
like the MHD equations with a source term on the right-hand side which 
vanishes if V • 5 == 0. These equations can be rewritten as a quasilinear 
system 



qt + A^{q)q^ + Ay{q)qy + A‘{q)q^ = 0 . 

The matrix A^{q) has essentially the same structure as the Jacobian matrix 
f'{q) from the conservative form, but with the column corresponding to 
modified in such a way that the eigenvalues consist of the seven values from 
(8.12) together with A = u in place of the eighth eigenvalue A = 0 of f'{q). 
The quasilinear equation hcts Galilean invariance and the eighth characteristic 
field can be interpreted as an advection equation for V-B. This modification 
lo I\q) was developed by Powell [187] and advocated as an approach to 
controlling the accumulation of errors in V • B, at least for problems with 
open boundaries so that any V • B produced by the numerical method can 
advect out of the domain instead of remaing stationary. This approach has 
also been used in conjunction with occasional divergence cleaning. See [8], 
[187], [188], [186] for more discussion of this approach. 



9. Relativistic Hydrodynamics 

In modeling flow at speeds where relativistic effects become important, space 
and time become intrinsically coupled and the Euler equations of gas dy- 
namics become more complicated. Nonetheless, it is still possible to write 
the relativistic Euler equations as a first-order hyperbolic system that can be 
advanced forward in time in some fixed reference frame. The reference frame 
will be called the lab frame since this is typically the frame from which we 
are observing. 

Here we will only consider the simplest case of flat spacetime. To compute 
flows near massive objects it would be necessary to modify the equations 
further to include the local curvature of spacetime. This can be done without 
essential new difficulties arising by including the appropriate metric tensor in 
place of the Minkowski metric rj below, provided the geometry of spacetime 
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is assumed to be fixed or to vary in some known manner. This is what is 
generally refered to as relativistic hydrodynamics (RHD). 

To model the evolution of spacetime as mass is redistributed requires the 
Einstein equations of general relativity, which are significantly more difficult 
to solve (see Sect. 9.7). 

9.1 Conservation Laws in Spacetime 

There is a symmetry between space and time in any system of conservation 
laws such as (1.7) or (2.27) that may not be apparent from the derivation of 
Sect. 2., but which is worth observing when starting to think about relativistic 
flows. The three-dimensional continuity equation for the conservation of mass, 
for example, can be written as 

V-Fig)=0, (9.1) 

where 
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If we call F{g) the 4-flux of then this equation states that the divergence 
of the 4-flux is zero. The flux functions gu, gv, and gw represent the flux of 
g through surfaces with constant x, y and z, respectively. So we see that g 
itself can be thought of as the flux of g through constant t. 

If we let i? be an arbitrary 4-dimensional volume in ordinary spacetime, 
then (9.1) results from the requirement that integrating the normal 4-flux 
over the boundary of O gives zero. Equation (2.28) is a special case of this 
where the volume is a 4-cube. In (2.32) we generalized this to a “cylinder” in 
time with arbitrary cross-section f?, but we now recognize that this can be 
generalized further to any arbitrary 4-dimensional volume. 

Figure 9.1 illustrates the concept of flux-in-time for the continuity equa- 
tion in one space dimension. Here the spacetime volume is taken to be a 
rectangle. Several particle paths are drawn. Suppose the particles each have 
unit mass. Then the mass flux f = gu can be interpreted as the number of 
particles crossing a fixed point x per unit time, and the integral of this flux 
over the left or right side of the rectangle in Fig. 9.1 gives the flux through 
that side, which is simply the number of particles crossing it. 

The density g is the number of particles per unit length at a fixed time, 
and hence integrating the density over the top or bottom of the rectangle 
simply gives the number of particles crossing that side. Hence we see that 
the density function can be interpreted as mass-flux through a constant-time 
surface. 

The conservation law dtg-\-dxf = 0 says that the divergence of the space- 
time flux is zero, which simply reflects the fact that particles are conserved 
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Fig. 9.1. Particle paths through a spacetime volume 

— the same number of particles must be leaving the rectangle in Fig. 9.1 as 
are entering it. 

Furthermore, we can combine the continuity equation with the conserva- 
tion of momentum equations by introducing the tensor 

Q QU QV QW 

Qu QV? -f p guv guw 
gv guv Qv’^ -h p gvw 
gw guw gvw gw^ 

This is an extension of the stress tensor (2.34) which includes density and the 
mass fluxes, so it might be called the stress-mass tensor. The conservation 
laws for mass and momentum can then be summarized by stating that the 4- 
divergence of M must vanish. This tensorial viewpoint might be valuable if we 
wanted to use an arbitrary curvilinear grid in spacetime, as the conservation 
laws would then be given by requiring that the covariant derivative of M 
be zero. Ordinarily a curvilinear grid is only introduced in space, however, 
combined with time stepping in the usual way. The tensorial nature of the 
stress tensor (9.2) is important in transforming the momentum equations to 
an arbitrary spatial grid, but the tensor M is generally not introduced (but 
see Section 5.10 of [167]). 

At relativistic velocities, however, the coupling between space and time 
requires that we extend the ordinary momentum to the 4-momentum, which 
includes a temporal component and introduces a 4 x 4 tensor for its conser- 
vation. It is not, however, the continuity equation that gets incorporated into 
this tensor, as one might expect from the above discussion. Instead it is the 
conservation of energy equation that is used, and the resulting tensor T is 
called the stress-energy tensor. However, because of the interchange between 
mass and energy in relativity theory, this is related to conservation of mass. 
The continuity equation stands alone and enforces the conservation of par- 
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tides, though the perceived partitioning between mass cuid energy in each 
particle depends on its relative speed. 

9.2 The Continuity Equation 

The relativistic Euler equations will be briefly derived using the notation of 
[162], for example. We first extend the continuity equation to the relativistic 
case. It is still true that particles are conserved, just as in Fig. 9,1. This is 
illustrated in Fig. 9.2 for a flow at relativistic speed u hitting a wall that is 
stationajy in the lab frame (the x-t axes). The x'-ct' axes show the Lorentz 
frame of the moving gas. A relativistic shock wave from the wall decelerates 
the gas to zero velocity in the lab frame. Several particle paths are shown 
and again the net number of particles passing through the surface of any 
spacetime rectangle must be zero. Now, however, the perceived mass of each 
particle changes with its velocity, so it is not true that the net mass flux, as 
measured in the lab frame, will be zero. If mo is the rest mass of a particle, 
then the perceived mass is Wmo, where 



- (u/c)^ 

is the Lorentz factor. If is the rest density of the gas, i.e., mass per unit 
distance in the local rest frame of the gas at each point, then the density 
measured in the lab frame (perceived mass per unit distance along the x~ 
axis) is 




Fig. 9.2. Minkowski diagram for relativistic flow hitting a wall, generating a shock 
wave which brings the gas to a rest in the lab frame. Particles are still conserved 
when integrating over any spacetime region, such as the dashed rectangle 
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D = Wgo . 

Then the continuity equation for the relativistic gas can be written as 

Dt + {Du)^ = 0 . (9.3) 

Note that W depends on u (and hence varies with x and t in general) so that 
the factor W cannot be cancelled out of this equation. It is not true that 
(eo)t + (Qou)x = 0. 

9.3 The 4-Momentum of a Particle 

We now study the generalization of the momentum to the 4-momentum. First 
consider a single particle moving along an ar bit ary path in spacetime. Denote 
a point in spacetime, and the resulting covariant and contravariant gradient 
operators, by 




respectively, with the convention = ct (and as usual Greek indices range 
from 0 to 3 while latin indices range from 1 to 3). We use the Minkowski 
metric 

■ -1 

• (9.4) 

1 

We can then parametrize the particle path by 
x(t) , 

where i is proper time, i.e., time as measured by the particle. It is then natural 
to define the 4-velocity of the particle as 

dx{t) 

U — . 

dt 

In the lab frame we have dt = ^^dt and hence 




The 4-momentum of the particle is then defined as 
P = mou , 

where mo is the rest mass. The O’th component of this is 
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po = Wmoc = WEo/c , 

where 

Eo = moc^ 

is the rest energy of the particle. 

9.4 The Stress-Energy Tensor 

Generalizing the stress-mass tensor M of (9.2) to the relativistic stress- 
energy tensor is complicated by the need to properly model pressure. (Ig- 
noring pressure gives “relativistic dust” as discussed in [211].) In the nonrel- 
ativistic Euler equations, the energy for an ideal gas is defined by 

r. 1 2 

E= ge^ -gvT , 

where e is the specific internal energy. Note that 

E+p = gie + p/g) + = gh+]^gv? , 

where h = e+pj g is the specific enthalpy. The energy flux u{E-hp) contains 
both an advective term uE and a term up that corresponds to work done 
compressing the gas. 

In the relativistic case, we must incorporate the relativistic enthalpy into 
the momentum in order to obtain a tensor conservation law. The relativistic 
specific enthalpy is defined by 

h = + € -hp/go . 

Note that 

goh = goc^ go^+P 

is the ordinary enthalpy plus a new term corresponding to the rest-mass 
energy which must be included with the internal energy in the relativistic 
case. In some works (e.g., [14]) the quantity h/c? is used instead, and is 
called the mass- equivalent enthalpy density. 

The stress-energy tensor then takes the form 

= {gQhlc^)u^u^ -f , (9.5) 

where 7]^^ is the Minkowski metric. With this definition of T, the conservation 
laws for momentum and energy take the desired form, and simply state that 
the divergence of this tensor must vanish: 

= 0 . 

In the lab frame we have 

TOO 

= W{c^D-\-De + Dp/go) ~p 



(9.6) 




138 R.J. LeVeque 



while the momentum is defined by 
= 5 ' = QohW^uyc 
and the stress-energy tensor is 
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In one space dimension, for example, we have the conservation laws 

si -f {S^v} -\-p)x — 0 (9*8) 

Et + Sl = 0 (9.9) 



together with the continuity equation (9.3). 

9.4.1 Energy Perturbation. Computationally, the fact that the energy 
E contains the rest-mass energy can cause difficulties, since this term can 
overwhelm the internal energy due to the presence of the factor. This can 
lead to a loss of all accuracy on a finite-precision computer. For this reason 
it is necessary to solve for the energy perturbation rather than the energy 
itself. Note that expanding 

W = = 1 + + 0{{ulcY) 

y/\ - {u/cy i 

in (9.6) gives 

E = (l + 1(7)' + • • •) + Evhn -P (9 10 ) 

= c^D -b -hDe-h(W~ l)p -f 0((u/c)2) . 

In the nonrelativistic limit (u/c 0 and IT — )• 1, po), we see that this 
approaches 

E - QoC^ + ' 

This gives the usual kinetic and internal energy, plus the rest-mass energy. 
(Note that kinetic energy arises from the rest-mass energy and the expansion 
of IT.) The expansion (9.10) suggests that computationally the appropriate 
energy perturbation to consider is thus 

t^E-c^D . 



Subtracting c times the continuity equation (9.3) from the conservation of 
energy equation (9.9) then gives the equation 

-Tt -f- (5^ — cDu^)x = 0 . 
c 

adiapThis equation, together with the continuity equation (9.3) and the con- 
servation of momentum equation (9.8), gives the relativistic Euler equa- 
tions in one dimension: 
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Dt^‘{Dv})^ = 0 , 

5^ + {S^u^ + cp)x = 0 , 

Tt + {cS^ - c^Du^ )x = 0 . 

In the literature, units are often chosen so that c 
these equations. 

9.5 Finite Volume Methods 

The finite volume methods discussed in Sections 4,2, 4,5, and 4.7 can be ex- 
tended to the relativistic case fairly easily. Solving the one-dimensional Rie- 
mann problem at each cell interface can be done in the same way as outlined 
in Sect. 3,4 for the Newtonian case, with a few minor changes. Taub [236] first 
developed the theory of simple waves and shocks for relativistic flow. Further 
work was done by [116], [150], [241], and other researchers. The complete 
solution to the relativistic Riemann problem is clearly presented by Marti 
and Muller [160], An all-shock approximate Riemann solver (see Sect, 4,6,1) 
was developed by Balsara[14]. See also [66], 

The basic approach is the same as in Sect, 3.4, once the functions {p * ; qi) 

and U^{p*\qr) have been appropriately modified. Figure 9,3 shows the solu- 
tion to the Riemann problem in the u-p plane in the relativistic case, for a 
case where the initial velocities are zero but there is an enormous pressure 
jump that gives rise to motion at relativistic speeds. This figure also shows 
the solution that would be obtained with the nonrelativistic Riemann solver. 
Note that this would give a superluminal velocity in the intermediate state! 

One additional complication in the relativistic case is that it is more 
difficult to switch from the conserved quantities (T>,5, r) to the primitive 
variables {g.u^p) needed in this solution process. In the Newtonian case it is 
trivial to compute u from g and gu^ for example, but in the relativistic case 
we have 

D = qW , 5 = QhW'^u/c 

and the value of W depends on u, while the enthalpy h also depends on the 
internal energy e and the pressure. Hence we must solve a nonlinear system 
of equations to recover the primitive variables from the conserved quantities, 
as described in [160], 

Figure 9.4 shows a relativistic shock tube computation performed by 
D. Bale with the one-dimensional CLAWPACK code. The initial data is taken 
from Problem 2 of [162], and the results agree favorably with what is shown 
in Fig, 4 of that paper, as computed with a relativistic PPM method on the 
same grid (400 points). The superbee limiter has been used in this computa- 
tion. The structure of this Riemann solution is similar to what was seen in 
Fig. 2.2 for a nonrelativistic flow with analogous initial data (constant den- 
sity but higher temperature on the left). Again a thin zone of high-density 
gas is swept up by the leading shock. In the relativistic case, however, this 
structure can be exceedingly thin, leading to computational challenges. 



(9.11) 



= 1 and does not appear in 
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Fig. 9.3. Comparison of Hugoniot adiabats for the flow that is assumed to be non- 
relativistic (dashed lines) with the Taub adiabats of relativistic flow (solid lines) . S 
and TZ refer to shock and rarefaction curves and the superscript sr refers to the spe- 
cial relativistic case. With data qi and Qr the nonrelativistic Riemajin solver would 
predict the state q* in solving the Riemann problem, which has a superluminal 
velocity. The correct intermediate state in the solution to this relativistic Riemann 
problem is denoted by q ^^ . Note that u is in units of c while the pressure is in units 
of 



It is easy to see why this structure should be very thin at ultrarelativistic 
speeds. The high temperature gas creates a shock moving into the cooler gas 
which accelerates this gas to a very high velocity v. The contact discontinuity 
moves at this fluid velocity. The shock is moving at some greater velocity s, 
but must still be subluminal, and so < 5 < c. As we raise the temperature 
on the left, v ^ c and s must lie in between, so from the lab frame it appears 
that the shock and contact are moving at nearly the same speed, with a very 
thin zone in between. (From the reference frame of the gas in this zone, on 
the other hand, the shock would be moving ahead at enormous speed and 
this zone would appear to widen very rapidly.) 

Note that in this Riemann problem the initial velocity is zero everywhere 
but the enormous left-state pressure gives rise to macroscopic flow at relativis- 
tic speeds. But even the left-state gas which is still “at rest” is in fact moving 
microscopically at relativistic speeds; it must be to create such pressure. Such 
a gas is refered to as “relativistically hot” . By contreist a “cold” relativistic 
gas is one which is moving at relativistic speed in the chosen reference frame, 
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Fig. 9.4. Relativistic shock tube computation with initial states = 

(1,0,1000.0) and (^r,^r,Pr) = (1,0,0.01). Velocity is in units of c and pressure 
in units of c^, while the temperature plotted is p/g. The solid line shows the true 
solution 



but for which there exists a reference frame in which all the molecules are 
moving at nonrelativistic speeds. (See also Section 5.5 of Muller’s notes.) 

9.6 Multi-dimensional Relativistic Flow 

A further complication arises in multi-dimensional flow. For a multi-dimen- 
sional finite volume method we must solve a one-dimensional Riemann prob- 
lem normal to each interface between grid cells. For the Newtonian Euler 
equations this is easily done with a trivial extension of the one-dimensional 
Riemann solver. The structure of the solution is completely determined by 
the density, pressure, and normal velocity in the two neighboring grid cells. 
There are also transverse velocities defined in each grid cell, but these values 
have no effect on the solution to the one-dimensional problem. The solution 
to the three-dimensional problem consists of 5 waves, the 3 waves from the 
one-dimensional solution plus two shear waves that carry the jumps in the 
transverse velocities. These shear waves travel at the velocity u* of the con- 
tact discontinuity, and an arbitrary jump in transverse velocity is allowed 
across these waves. 








142 R.J. LeVeque 



At relativistic speeds, however, the magnitude of the transverse velocity 
does have an impact on the solution of the Riemann problem in the normal 
direction. This is because the Lorentz factor W depends on the total velocity, 

W — [l - {u^ + w^)/c^) , 

and so changing the transverse velocities affects the structure of the one- 
dimensional solution. The transverse velocities are constant across a shock or 
through a rarefaction fan in the Newtonian case, but are not constant at rel- 
ativistic speeds. For shock waves, the presence of transverse flow only compli- 
cates the algebraic expressions. For rarefaction waves, however, the solution 
process now requires solving a coupled system of two ordinary differential 
equations to integrate the two transverse velocities through the rarefaction 
fan [160].. Using the 2-shock approximation avoids this complication, 

9.7 Gravitation and General Relativity 

So far we have only considered the case of flat spacetime, using the Minkowski 
metric (9.4). In the neighborhood of massive objects such as black holes or 
neutron stars, it is necessary to take into account the curvature of spacetime 
using a more general metric g in place of g, with distance measured locally 
by 

ds^ = . 

9.7.1 Test Fluid Approximation. In the simplest case, we assume that 
this metric is known and we wish to solve the hydrodynamic equations for 
the motion of a fluid whose mass has negligible effect on the metric. This is 
sometimes called a test fluid. An example might be the study of accretion 
into a black hole, taking g to be the the radially symmetric Schwarzschild 
metric ([167], [211], [252]). 

In this case the hydrodynamics equations are modified simply by replacing 
the stress-energy tensor (9.5) by 

- (Qoh/c‘^)u^u^ + pg^^ , (9.12) 

and the conservation laws still take the form 
= 0 . 

When this is written out in the lab frame we obtain conservation laws similar 
to (9.11) (in one space dimension) but the equations now also have source 
terms which arise from the covariant differentiation of the metric term. 

An approximate Riemann solver for general-relativistic hydrodynamics 
was developed by [82]. Related methods are developed in [16], [201]. 
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9.7.2 The Einstein Equations. If we wish to study the motion of massive 
objects, say the collision of two black holes or neutron stars or the core 
collapse of a neutron star in the formation of a black hole, then we must 
determine the metric g as part of the solution process. Such events give 
rise to gravitational waves which propagate outward at the speed of light as 
ripples in the spacetime metric. Gravitational radiation of this form should 
be constantly bombarding the earth from interesting astronomical events. In 
principle this can be detected, for example by accurately measuring distances 
in two orthogonal directions and monitoring how the distances compare to 
one another. See [1], [238], [240] for surveys. See also [239] for an excellent 
introductory overview of this topic and others in gravitational astrophysics. 

Unfortunately, such events are sufficiently far away that oscillations in the 
metric observed on earth are expected to be of relative magnitude 10“^^ or 
less. Thus the entire earth distorts by on 10~^^ meter (10 times the diameter 
of an atomic nucleus), and laboratory instruments of more modest size have 
so far been unable to detect gravitational waves. However, there are cur- 
rently several highly-sensitive instruments under construction which may be 
capable of measuring these waves within a few years. For this reason, there 
is currently considerable interest among numerical relativists in predicting 
the gravitational wave signature of various possible events, in order to help 
interpret any data received and separate the signal from noise. 

Astronomy has been revolutionized in the past by developing the ability 
to observe electromagnetic radiation in new wavelength regimes (e.g., by X- 
ray and radio telescopes). If successful, the development of gravitational- wave 
observatories will surely lead to many new surprises. 

To compute the evolution of the metric, we must solve the Einstein equa- 
tions 

G = SttT (9.13) 

which are described in detail in many sources, e.g., [88], [167], [211], [252]. 

Here T is the stress-energy tensor (9.12) and G is the Einstein tensor, 
which is a function of g and its first and second derivatives which gives the 
proper measure of the curvature of spacetime. The structure of spacetime 
affects the hydrodynamics, just as in the previous section, but the hydrody- 
namics also affects the structure of spacetime when the motion of massive 
objects is being modeled. 

Since G and T are symmetric tensors, the Einstein equations (9.13) are a 
system of 10 equations in 4-dimensional spacetime. One might imagine trying 
to find solutions to these equations over some region of spacetime with given 
boundary conditions, but it is not clear what sort of boundary conditions are 
required since there should be causality in time-like directions. 

9.7.3 The 3-|-l Formalism. Instead of viewing (9.13) as a boundary-value 
problem over a region of 4-dimensional spacetime, the modern approach is 
to view spacetime as a foliation of 3-dimensional space-like manifolds with a 
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spatial metric 7 (obtained from g) defined on each sheet. The equations are 
then solved by marching forward in the time-like direction that links these 
manifolds together, advancing 7 forward in “time”. This approach was orig- 
inally proposed by Arnowitt, Deser, and Misner [6] and is also known as the 
ADM formalism. The equations they proposed have a Hamiltonian structure. 
More recently, other 3 + 1 formulations of the Einstein equations have been 
proposed that give first-order hyperbolic systems of conservation laws. These 
equations are amenable to solution by high-resolution finite-volume methods 
of the sort discussed in these lectures. 

The Einstein equations involve second derivatives of the metric g. In order 
to write these equations as a first-order hyperbolic system, it is necessary to 
introduce a larger set of variables corresponding to various derivatives or 
combinations of derivatives of g. There are several ways in which this can be 
done, and it is not clear at this time which formulation is best. The number of 
variables and equations varies between different formulations, ranging from 
30 to 58, so these systems are quite large. Some possibilities are discussed in 
[35], [36], [50], [93]. 

Hyperbolic formulations have received recent attention for theoretical as 
well as numerical work. The well-developed mathematical theory of hyper- 
bolic conservation laws can be used to gain new insights in general relativity. 
For example, Smoller and Temple [224] have found new solutions to the Ein- 
stein equations which allow the possibility that the visible universe may only 
locally resemble an expanding 3-sphere, being separated from the remainder 
of the universe by a shock wave. See also [222], [223]. 

9.7.4 Elliptic Constraints. In addition to the first-order hyperbolic equa- 
tions, there are also various constraint equations which should be satisfied 
between these variables. These constraints typically take the form of elliptic 
equations. In principle it is enough to impose these constraints at the initial 
time, known as the initialization problem (e.g., [170]). The time evolution of 
the Einstein equations guarantees that the constraints will be satisfied also 
at all later times. Numerically, however, they may fail to be satisfied at later 
times in the same way that the V • B = 0 condition in the MHD equations can 
fail numerically (see Sect. 8.6). It may be necessary to use similar projection 
procedures to maintain these constraints. In current practice, however, these 
constraints are often imposed only at the initial time in the process of prepar- 
ing the initial data. It is then hoped that an accurate numerical method will 
maintain them sufficiently well at later times, avoiding a costly elliptic solve 
in each time step. Choptuik [47] gives some justification for this for smooth 
solutions, where one can argue that the constraints should remain satisfied to 
the order of accuracy of the numerical method. For problems involving shock 
waves this will not generally be true. 

9.7.5 The Vacuum Einstein Equations. The simplest case of the Ein- 
stein equations arises from setting T = 0 so that the equations reduce to 
G — 0. This corresponds to assuming that there is no mass in the compu- 
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tationa] domain. This simplifies the equations since we then do not need to 
worry about the hydrodynamics and associated complications such as fluid 
dynamical shocks. Such an approach would be valid for modeling the inter- 
action of two black holes, for example, if the domain does not include the 
black holes themselves. 

As noted above, the first-order hyperbolic system that we obtain is quite 
large, with at least 30 variables and equations. Developing and using a Rie- 
mann solver for such a system would appear to be a daunting task. However, 
these systems typically decouple into sets of linear hyperbolic systems for 
which the eigenstructure can be relatively easily determined. 

9.7.6 Choice of Time Slicing. To solve the Einstein equations numeri- 
cally in the 3 -f 1 formulation, let represent the computational 

coordinates, so that we march forward in t from one spacelike manifold M{t) 
to the next Ai{t At). The goal is to construct the metric as we go. 
There is a great deal of flexibility in this construction since there are many 
different foliations of spacetime that we could choose, and many different 
choices of spatial coordinates within a given foliation. This flexibility must 
be pinned down and also exploited in order to avoid singularities. We may 
need to avoid physical singularities such as black holes, but we must also 
avoid artificial singularities that can arise even in computing flat spacetime 
from a poor choice of coordinates, as illustrated at the end of this section. 

The metric g is generally decomposed as 

ds^ = g^i,dx^dx^ 

= —a^dt^ + gij{dx^ -f ^^dt){dx^ + /3^dt) , 

where gij is the spatial part of the metric, a is the lapse and /?* the shift 
vector. The lapse measures the elapsed proper time between two manifolds 
M{t) and Ad(t H- At) (note that a, /?‘, and gij all vary with in general, 
so the lapse can be different at different locations). An observer starting at 
location x^ on M{t) who moves normal to this manifold (i.e., remains at rest 
in his comoving frame) for proper time adt, will find himself on the manifold 
M(t~\~ At) at location x^ + This shift in coordinate location is due only 
to a change in the labeling of coordinates from one manifold to the next, and 
defines the shift vector. 

The particular foliation of spacetime is determined by the choice of a 
together with the initial metric gij on Mifi) and its derivatives needed to ini- 
tialize q in the hyperbolic conservation law. In particular, the lapse a deter- 
mines how rapidly proper time advances in different regions of the simulation, 
and hence can be used to avoid certain physical singularities. Figure 9.5(a) 
shows a schematic diagram of a time slicing that might be useful in mod- 
eling gravitational collapse and the formation of a black hole. The physical 
singularity forms on the ^-axis at time t* and the time slicing is chosen in 
such a way that it is possible to compute out to larger times away from the 
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Fig. 9.5. Possible time slicing for gravitational collapse of matter (within the heavy 
curve) to a black hole along the i-axis. (a) Time is retarded to avoid the physical 
singularity by choosing a small lapse near the i-axis, (b) Horizon boundary condi- 
tions could also be used to avoid computing within the apparent horizon 



singularity by slowing down the progression of time (taking a smaller) near 
the origin. 

Another possibility for this particular calculation would be to use apparent 
horizon boundary conditions as discussed in [4] and [213]. Since information 
cannot escape from within the event horizon, the computational domain can 
be truncated at some point within the apparent horizon and nonreflecting 
outflow boundary conditions then used. 

Once the time-evolution of the lapse and shift have been specified, one 
obtains a hyperbolic system for certain derivatives of the spatial metric Qij . 

One simple choice is a = 1, = 0, which gives geodesic slicing. This 

would not be suitable if we want to slow down the progression of time as in 
the previous example. But even if physical singularities are not a problem, 
geodesic slicing often leads to the formation of singularities in the coordinate 
system. 

Figure 9.6 shows two examples of geodesic time slicing of flat Minkowski 
spacetime. In Fig. 9.6(a) the initial manifold was chosen to be a plane of si- 
multaneity for an observer at constant velocity. Geodesic time slicing leads to 
the coordinates shown in Fig. 9.6(a), which are simply Cartesian coordinates 
in this Lorentz frame. In Fig. 9.6(b), the initial manifold was chosen to be a 
curved spacelike manifold. Now geodesic time slicing results in a focusing of 
the coordinate lines into a caustic, or “coordinate shock” . 

The appropriate choice of time slicing is a major concern in numerical rel- 
ativity, and a variety of different approaches have been used. Some examples 
are given in [35], [36]. 

9.7.7 Adaptive Refinement. Some spherically symmetric problems in nu- 
merical relativity can be solved in 1 + 1 dimensions, such as spherical stellar 
collapse and black hole formation. Spherically symmetric solutions cannot 
give rise to gravitational radiation, however, and the most interesting prob- 
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Fig. 9.6. Two examples of geodesic time slicing of flat Minkowski spacetime, (a) 
The initial manifold is a plzine of simultaneity, (b) A different initial manifold leads 
to a coordinate singularity 



lems will ultimately require solving the full 3+1 equations. Given the size of 
the hyperbolic systems involved and the large domains required, this is a chal- 
lenging task for which adaptive mesh refinement, as discussed in Sect. 6.3.2, 
will almost surely be needed for efficient computation. 

Even for 1 + 1 calculations, adaptive refinement can be crucial. For exam- 
ple, Choptiuk[48], [49] discovered an interesting new effect via careful high- 
resolution adaptive numerical simulation. Argyres [5] gives a concise overview 
of this and related work. 

9.7.8 The Full Einstein Equations. If we wish to include the effect of 
mass in the Einstein equations with a time-dependent density function, then 
we must also solve the hydrodynamic equations for the fluid motion in curved 
spacetime in conjunction with advancing the metric that determines the ge- 
ometry. This would be necessary, for example, in modeling gravitational waves 
resulting from core collapse supernovae, as discussed in Section 3.8 of Muller’s 
notes. Presumably a full treatment would require coupling general relativity 
with magnetohydrodynamics, radiative transfer, nuclear burning, and a host 
of other physical effects. 

One possible approach is to use a fractional step method, in which we al- 
ternate between freezing the geometry and solving the hydrodynamics equa- 
tions as discussed in Sect. 9.7.1, and then freezing the mass distribution and 
using the Einstein equations to update the metric. Researchers are just be- 
ginning to explore such methods. Even with the on-going exponential growth 
of available computing power, it is clear that the further development of even 
more accurate and efficient numerical algorithms will play a crucial role. 
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1. Basic Radiation Theory 

References: [1, 5, 22, 29, 35]. General Reference: [24]. 

1.1 Specific Intensity 

Define the specific intensity such that 

6£ = I{x,t;n,i') dS cosadudt ( 1 . 1 ) 

is the radiant energy in a range {i/, v -f du) crossing dS, whose normal makes 
an angle a with n, the direction of propagation of the radiation, into a solid 
angle da;, in a time d^. The units of specific intensity are 

[/] = ergs cm”^ s"^ hz“^ sr“^. (1.2) 

In one-dimensional plane-parallel or spherically symmetric media I is inde- 
pendent of the azimuthal angle around the outward normal; it can depend 
only on one space coordinate {z or r) and on the polar angle 0 between the 
outward normal and the direction of photon propagation n. We can then 
write I = 7(z, t\ /x, i/) or /(r, t; /x, u) where /x = n- fc or /x = n-f. 



1.2 Photon Number Density 

I is appropriate for the macroscopic view. To think in terms of photons, define 
the photon number density xp such that xp{Xyt]n,i/) duduj is the number of 
photons, per unit volume, at (x, ^), with frequency (z/, ly + di/), traveling with 
speed c into solid angle da; around n. Then the number crossing area in 
time dt into do; is ?/; n * dS (cdt) duduj. Each photon has energy hi/, hence 

S£ = chu xp{x, t; n, u) dS cos a do; du dt . (1*3) 

Therefore 



/(x, t; 7i,u) = c hu xp{Xj t\ n, u) . 



(1.4) 
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1.3 Photon Distribution Function 



Define the photon distribution function such that /a(a:, t;n,p) is the 
number of photons, per unit volume, at (x, i), with momenta (p,p + dp) — 
(hi//c)(n, n + dn). Using d^p = p^dpdcj = {hjcYvAuAuj we find 

l(?)fYidv duj = V^di/dcj , (1.5) 



which implies 



/r 5 



and 



/ = . 



( 1 . 6 ) 



(1.7) 



/r is the direct analogue of the particle distribution function for material 
particles. By exactly the same analysis as used for material particles, one can 
show that /r is a relativistic invariant under Lorentz transformation. 



1.4 Mean Intensity 



The mean intensity is the average of the specific intensity over all solid angles, 

( 1 . 8 ) 



J(x, t\i') = ^ I{x, t] n, z/) du) . 



The units of the mean intensity are the same as the specific intensity. In 
one-dimensional planar or spherically-symmetric media, the integral over the 
azimuthal angle ^ is trivial (because I is independent of ^). Therefore using 
do) = sin & dO d^ = —dp d# we have 






ii/dp . 



(1.9) 



Thus, the mean intensity is the zeroth angular moment of the specific inten- 
sity. 



1.5 Radiation Energy Density 

The monochromatic radiation energy density at frequency v equals (the num- 
ber density of photons) x(/iz/), summed over all solid angles, that is 

E{x^ t\ i/) = hi/ ^ t\ n, v) dcj , (1-fO) 

which implies 

1 f 47T 

Ej, ~ - <b I{x,t\n,i/)d(jj — — Ju . (1-lf) 

c J c 

The total radiation density is E = Ei,di/ = (47t/c) J. 
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1.6 Radiation Energy Flux 



Define the monochromatic radiation energy flux F{x^ t] u) (= F,^) such that 
F • d5 is the net rate of radiant energy flow, per unit frequency interval, 
through dS. The number flux through dS is 



N — il){x,t]n,i/)cnduj 



dS 



( 1 . 12 ) 



To get the energy flux, multiply by hi/. Then convert to /, obtaining 



d£ 



<1 



I{x, t\ n, z/) 71 du; ) • dS 



(1.13) 



hence 

F{x, t;i/) = ^ /(x, t] n, i/)ndu; . (1-14) 

The units of are [Fj,] = ergs cm~^ s“^ hz“^ The total radiant energy flux 
is F = Ft, du. Note that in one-dimensional planar or spherical geometry, 
azimuth^ symmetry of I implies only one nonzero component of F (along 
}z in planar geometry and r in spherical geometry. This is easy to see by 
taking rays in cancelling pairs in the (x, y) plane. People often call this one 
component “the” flux as if it were a scalar: 



F(z, t\ 1 /) = 2n j /(z, t; /x, z/)/x d/x = F/ . 



(1.15) 



But don’t forget that F is really a vector! Following Eddington, it is useful 
to define 




1 

2 




z,^;/x,z/)^d/x . 



(1.16) 



Note that Hj, is the first angular moment of the specific intensity. 



1.7 Radiation Momentum Density 



The momentum of a photon traveling in direction n is {hv/c) n. Therefore 
the net rate of radiation momentum transport through dS is (1/c) times 
the energy flux through dS, that is, (l/c)Fj, •dS. Thus, the monochro- 
matic momentum flux is {1/c) Ft,. This flux is transported by particles 
moving with speed c. Therefore the monochromatic momentum density is 
(1/c) • momentum flux. 




(1.17) 



and the total radiation momentum density is G = (l/c^)F. The units of Gt, 
are [Gu] — g cm”^ s”^ hz“^. 
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1.8 Radiation Stress Tensor (Radiation Pressure Tensor) 

Definition, The radiation stress tensor is most easily derived in terms of the 
photon picture. Let = net rate of momentum transport in the ith direc- 
tion per unit area of a surface perpendicular to the jth coordinate axis. The 
number of photons of frequency ly moving in the jth direction and crossing a 
unit area per unit time is 'ijjt.cnK Each has momentum (/iiv/c)n^ in the ith 
direction. Thus, 



P^^{x,t]v) = ^ 


t\ n, u){hv ! c){cn^) du , 


(1.18) 


or 


c 


^ I{x, t\ n, i')n'^n^dij , 


(1.19) 


which implies 


P{x,t;i/) = 


/(x, i;n, u)nndLj . 


(1.20) 



P is the monochromatic radiation stress tensor. Note that P is manifestly 
symmetric, i.e., = P^\ The total radiation stress tensor is 

rOO 

P{x,t) — P^du . (1*21) 

Jo 

The units of P^ and P are: [P^^] dynes cm“^ hz“^ and [P] = dynes cm“^. 

One- Dimensional Media. In general one needs six distinct components to 
specify the (symmetric) radiation stress tensor in three dimensional media: 
three along the diagonal, and three off-diagonal. But for one dimensional 
symmetry (planar or spherical) there is substantial simplification. 

1) The axisymmetry of h in a one dimensional medium implies that the 
off-diagonal elements are identically zero (ray by ray cancellation). 

2) We can evaluate the diagonal elements easily. The integrands are: 
sin^ 0 cos^ sin^ & sin^ and cos^ 0 = fp. 

We find 

(\{E,-Pu) 0 0 \ 

P. = 0 \{E,-P,) 0 

Vo 0 Pj 

(Pu 0 0 \ . (ZP^-E^ 0 0\ 

= 0 Pp 0 ~ 9 0 ZP„-E„ 0 \ . (1.22) 

Voop^/^V 0 0 0/ 

Here Ei, is the radiation energy density and is the scalar 
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P^-—f I{x,t;n,i/)fi^dn = {—)}■ f hix^dn=~K^ . (1.23) 

c c I c 

Ku is the second angular moment of the specific intensity. The scalar Pj, 
is often called “the” radiation pressure, as if P„ were a scalar because: 1) 
in one-dimensional symmetry the operators (d/dx) and (d/dy) = 0 so only 
P33 (which equals Pzz in planar geometry, and Prr in spherical geometry) 
matters, and 2) as we shall see later, when the radiation field is isotropic 
(e.g., many mean free paths from any boundary - as in the interior of a star) 
then Pi, so the second matrix in Eq. (1-22) vanishes, and P becomes 

diagonal, isotropic, and is fully specified by a single number, P^. 

In short, in one-dimensional symmetry we need only two numbers, Ei, 
and Pi, to specify Pj,, which is diagonal. 

Isotropy. The trace of Pi, is invariant: 

P" = + Pr + P“ = ^ ^ IM +nl+ nl) du = E,. (1.24) 

Therefore we can in general define a mean pressure 




as was true for material particles. However, unlike material particles where 
the distribution function f{x^u,t) is nearly isotropic on the small scale, so 
that all diagonal elements of P very nearly equal P, for radiation the elements 
Pzz and Pxx (or Pyy) can be markedly different, especially near boundaries. 
Thus, for radiation, Pj, has no useful dynamical significance. 

But if we simply assume that I{n) is isotropic, then f Inf du = | / /do;, 
{i = 1/2,3). Therefore in this case Pi, = and 

/P. 0 0 \ 

p, = 0 Pi, 0 . (1.26) 

VO OP,/ 

This case is of great importance because it applies with very high accuracy 
in 99"‘'% of a star. Only one number is needed to specify P, in this limit. 

Streaming Limit. To study the effect of a boundary, consider the free stream- 
ing limit. Take a plane wave traveling along the outward normal, /x = 1, so 
I = IqS {fi — 1). Example: the radiation field at very large distance from a 
point source (e.g., light from a distant star). Then it is trivial to show that 
J, = Hi, = Ki,, so the pressure tensor becomes 

/O 0 0 \ /O 0 0 \ 

P^= 0 0 0 = 0 0 0 . (1.27) 

VO 0 p,/ \o 0 eJ 




166 D. Mihalas 



Variable Eddington Factors. It is useful to introduce the variable Eddington 
factor [11, 22] 

d/i/ J (1.28) 

In one-dimensional symmetry this one scalar is sufficient to specify the asym- 
metry of the radiation field. 

The variable Eddington factor provides a closure relation between and 
Ejy in the radiation moment equations discussed later. In a material gas one 
can always specify the closure relation [p/e = | (N.R.) or | (E.R.)j because 
the particle mean free path is the characteristic system size. Even near 
boundaries it can be derived from the Chapman-Enskog solution. Radiation 
theory differs fundamentally from gas dynamics: one must construct the clo- 
sure by calculating fu from a solution of the transfer equation. The reason is 
that a photon mean free path can be about the size of, or larger than, the 
characteristic system size. 

In a one dimensional medium we can expect | < /i. < 1 in most cases. In 
some cases one finds 0 < /*/ < 1; values less than | come from a “pancake” 
shaped distribution of radiation where the lateral radiation field is much 
stronger than the radial radiation field. An important example is an optically 
thin (radial direction) planar layer, which is representative of a shock. 

In two- and three-dimensional media we have, more generally, the Ed- 
dington tensor 

= PJE, . (1.29) 

Allowing for symmetry, the Eddington tensor has one, three, and six compo- 
nents in one-dimensional, two-dimensional, and three-dimensional geometry, 
respectively. Ad hoc prescriptions [25, 20] are sometimes used to estimate 
the Eddington tensor, but often satisfactory results are obtained only from 
direct calculation. 

1.9 Thermal Radiation 

Consider the important case of radiation in thermal equilibrium. Then^ 

I* = Bu(T) — — 1) (Planck function) . (1.30) 

Notice that /* is isotropic. Therefore = 0 inside the hohlraum. A nonzero 
Fi. imples an anisotropic radiation field; in a stellar interior a small temper- 
ature gradient produces the flux which transports the stellar luminosity. The 
monochromatic energy density in equilibrium is 

^ The superscript * designates quantities in thermal equilibrium. 
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j: = i: ^e; = - 1 ) . 

Then the total energy density is: 

which is Stefan's law. The integrated Planck functon is 

’ 



47T 



7T 



(1.31) 



(1.32) 



(1.33) 



where an = {a^cjA) is the Stefan-Boltzmann constant To calculate the flux 
emerging from a hohlraum, note that I* is isotropic. Thus, 



FbbM = 27 t / J5^(T)/id/i = 7tB^{T) , 
Jo 



(1.34) 



hence 



^BB = nB{T) = o-rT^ , 



(1.35) 



where BB stands for “black body”. Within the hohlraum the flux in the 
sense of (1.15) is zero. Again recalling that thermal radiation is isotropic its 
pressure is 



P* = ip* = iaRT^ . (1.36) 

TdS = d{E*V) + P*dV , (1.37) 

or 

dS = 4aRTVdT + iaaT^dy = d(iaRrV) , (1.38) 

o 

which implies 

5rad = ^aRT^V . (1.39) 



Suppose the enclosure is adiabatic. Then d5 = 0=> ^ + = 

~ constant or p*Y^/^ = constant. In fact, for pure radiation Fi = 
F 2 = F^ = 4/3. Therefore thermal radiation behaves like a polytrope with 
7 = 4/3. The specific heat at constant volume of thermal radiation is 

C„ = ^ (orTV) V = 4aRT^F . (1.40) 

In contrast Cp ^ 00 . The reason is that if P* is constant, T must be constant, 
which implies that dT = 0 even for finite dQ. 

The sound speed of pure radiation can be obtained from the general for- 
mula 
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For pure radiation | and p = |e. Therefore a = c/\/3 is the “sound 

speed” of an “acoustic disturbance” in pure thermal radiation. 



1.10 Thermodynamics of Thermal Radiation and a Perfect Gas 

It is easy to find the thermodynamic properties of a radiating fluid comprising 
both thermal radiation and a material perfect gas. Thus, 

NkT 1 

p = Pg + P* = — + -flRT^ (1.42) 

and 

e = -RT + — — . 1.43 

2 Q 

Further, TdS = d(peV) +pdV = d{^NkT + auT'^V) + (^ + |oRr^)dF 
which implies that d5 = A^fed[ln(T^/^V)] +d(|aRT^F) or 

S = Nk \n{T^/‘^V) + \ukT^V . (1.44) 

O 

Define a = P* /pg. Then one can show that 

c„ = ^ — (1 + So) , (1.45) 

z m 

Cp - ^ — (1 + 8a + ^a^) . (1-46) 

2 m 5 

One can see by inspection that these formulae go to the correct limits as 
a 0 and a — > oo. Further 



A 

A-l 

A 



Fs-l 



d\np\ 

d\ng) 


1 + 20a + 16a^ 
s (| + 12 q)(1 + a) 


(1.47) 


91np'\ 


1 1 -j- 5a -h 4a^ 


(1.48) 


ainrj 


, 1 + 20a + 16a2 ’ 


51nT^ 


, 1 -h 4a 


(1.49) 


91n ^ j 


V (| + 12a) • 



One easily sees that for all three values of the generalized adiabatic exponent 
r defined above, F — > | as a — oo and F — )> | as a — 0. With a modest 
amount of algebra these results can be extended to an ionizing material gas 
plus radiation. 

It should be noted that all of the results in this section apply for thermal 
radiation only (LTE). Thus, they hold in, say, the interior of a star, but not 
in a stellar atmosphere where A ^ or indeed in any open system which 
is out of thermodynamic equilibrium. 
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2. The Transfer Equation 

2.1 Absorption, Emission, and Scattering 

Opacity. Define the opacity or extinction coefficient x such that an element 
of material of length dZ, cross-section dS perpendicular to a beam moving in 
direction n, into solid angle do;, removes an amount of energy 

dS = dZ d5 do; dz/ dZ (2.1) 

in a time d^. The opacity x Is obtained from the sum UiUiai where is the 
number density of particle of type i which can absorb or scatter radiation at 
frequency with a cross-section The units of x ^-re [x] = cm"^. Thus, 

Ai/ = l/Xi' is a photon mean free path at frequency u. 

If the material is at rest and is homogeneous on the microscale, then x 
is isotropic. The same is true in the comoving frame of moving material. 
But in the laboratory frame the opacity of moving material is anisotropic 
because of the strong coupling of frequency to angles through the Doppler 
effect. Therefore in general x = This coupled angle-frequency 

dependence is a tremendous complication in moving media, and one is highly 
motivated to find how to work in the comoving frame of the fluid where 
material properties are isotropic. This goal is accomplished by development 
of the Lagrangian transport equation, which we shall discuss later. 

Emissivity. Define the emission coefficient r} such that an element dZ dS of 
material emits an amount of energy 

S£ = 7y(x, t; n, i/)dl dS dto di/ dt (2.2) 

into solid angle duj in the frequency interval -{- di/) in a time di. The 
units of the emissivity are 

[rj] = ergs cm“^ s“^ hz“^ sr"^ . (2.3) 

Again, the emissivity is isotropic in the rest frame of the fluid (the comoving 
frame), but in general will be anisotropic, i.e., rj = rj{x,t]n,rf)., in the lab 
frame for a moving fluid for the same reasons x Is. 

Scattering. It is sometimes useful to distinguish between “true” or thermal 
absorption- emission processes and scattering. Thermal processes couple ra- 
diation to the thermal state of the material locally. For example, suppose an 
atom is excited from its ground state to an excited state by the absorption of 
a photon, and then is de-excited back to the ground state by a superelastic 
collision with a particle in the plasma such as an electron. In this case the 
original photon is destroyed., and the photon’s energy is contributed to the 
thermal pool of the gas. Or suppose an atom is excited from its ground state 
to an excited state by an inelastic collision with a particle in the plasma, and 
then de-excited by the emission of a photon. In this case a photon is created 
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at the expense of the thermal energy of the particles in the gas. In a scatter- 
ing process a photon interacts with a scattering center, and emerges with an 
altered direction and/or frequency. In this ceise the emission rate is coupled 
to the ambient number of photons present, and not to the thermal state of 
the gas. An example is when a photon interacts with a free electron; for low 
energy photons the result is a conservative scattering process called Thom- 
son scattering^ but if the energy in the photon approaches the rest energy of 
the electron (X-rays), we have Compton scattering which admits large shifts 
in the photon energy. Another example is when an atom is excited from its 
ground state by a photon, sits briefly in the excited state, and then emits a 
photon in order to return to the ground state. 

We will adopt the notation x — where k is the absorption coefficient 
and cr is the scattering coefficient. For our purposes it will be adequate to 
consider only Thomson scattering, a = Uecro which is frequency independent. 
Then in the comoving frame^ 

Xo{x,t;i'o) = Ko{x,t;i/o) + ao{x,t) . (2.4) 

In a similar manner we can consider the emission in the comoving frame as 
consisting of thermal emission plus scattering 

rj(x,t;i/o) = + V^{x,t,uo). (2.5) 

Assume the scattering process is conservative (i.e., energy in = energy out), 
coherent {u\n = ^out)^ a,nd isotropic, in the comoving frame. Then 

SS - ^ ao{x,t)Io{x,t]no,i'o)du; = 47rcro(x,t) Jq(x, i/q) 

- 47rr?^(x,t;i/o) . (2.6) 

The last equality follows because we said the process is conservative. There- 
fore, 

7?^(x,t;i/o) = ao{x,t)Jo{x,t;uo) . (2.7) 

Thus, we see explicitly that depends on J. This fact is important because 
J at {x,t) may be determined by sources and sinks elsewhere and not at 
(x,t). Therefore t}^ is decoupled from local conditions. One can show that 
photons can scatter over large distances, through many mean free paths. 

Kirchhoff-Planck Relation. In the important case of thermal equilibrium we 
must have 

ivir = KlB.iT) . ( 2 . 8 ) 

This relationship is often extended to the idea of local thermodynamic equi- 
librium (LTE). Then, in the comoving frame 

^ The subscript 0 designates quantities measured in the comoving frame. 
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T]o{x,t;uo) = Ko(x,t;i>o)B[T{x,t),i/o] ■ (2.9) 

This result will be true when spatial gradients of material properties are 
small, i.e., properties vary on scales £ > Ap (e.g., in a stellar interior where 
I VT 1^ 10“^). The KirchhofF-Planck relation breaks down near boundary 
surfaces. In most radiation hydro problems we have little choice but to assume 
LTE (and the problem is hard even then!). 

Opacity and Emission Coefficients. Accounting for bound-bound (i j), bound- 
free (ifc), free-free (kk), and scattering processes we have {in LTE) 



Xo(x,t-,i'o) 




j>i 



-h^nenftaA;)b(z/o,T)| (1 -e + ngCTe 

K>0 + rieCTe . (2.10) 



2.2 The Equation of Transfer 

Derivation. In an element of material of length ds, the difference between 
the energy that enters one end at {x,t) and emerges from the other end at 
(x -h Ax, t -h At) results from absorptions and emissions (see Fig. 2.1). Thus, 
in Cartesian coordinates 



[I{x -h Ax,t + At\n,i/) — I{x,t]n,v)] d5da;di/di 

= [ri{x,t\n,u) — x{‘X^t\n,v) I{x,t\n,u)]ds dS du dv dt . (2.11) 

Expanding / in a Taylor’s series and noticing that At = dsjc we get 
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( X QJ. ^ \ 

l(x + Ax,t + At;n,i/) = 7(x,t;n,i/) + ( + — ) ds . (2.12) 

^ c C/i Cy s j 

Using (2.12) in (2.11) we get the transfer equation 



\cdt^ ds) 



/(x,f;n,i/) = 7j{x,t]n,u)-x{x,t;n,u) I{x,t]7i,i/) . (2.13) 



This is a purely phenomenological derivation. Approximations made here are 
discussed in Equations of Radiation Hydrodynamics by Pomraning [27, pp. 
47-49]. For a quantum electrodynamical derivation see [15, 16, 17]. 

Because s is a coordinate-independent pathlength, the above equation 
applies in arbitrary coordinate systems, provided we use an appropriate ex- 
pression in the evaluation of {d/ds). For example, in Cartesian coordinates 



ds 



dx d dy d dz d 
ds dx ^ ds dy ds dz 



d d d 



Therefore in Cartesians we have 



• v) j(a:,«;n,i/) = r]{x,t;n,u) 

~ x{x, t\ n, £/) I{x, t; n, u) . (2.15) 

In one-dimensional planar media (2.15) reduces to 

I dl dl , . . , ^ , . . 

+ -x(2:,i;^,i^)/(2;,i;/i,z/) . (2.16) 

For static media (v = 0) or steady flow, we can drop {d/dt) to obtain the 
“standard” one-dimensional transfer equation 

dl 

'^'dz ~ ~ . (2.17) 

Suppose 7] and x given. Then (2.17) is an ODE, and (2.16) is a PDE. 
Different techniques are needed to solve these two classes of equation. Suppose 
7] contains a scattering term; then on the right-hand side we have a term 
containing J = f J(^)d/i. Then the two equations just mentioned become 
(ordinary or partial) integrodifferential equationSj which are much harder to 
solve. 

In curvilinear coordinates the basis vectors rotate with respect to a 
straight-line path determined by a fixed propagation vector n. Therefore 
we must allow for changes in the components of n measured relative to those 
basis vectors. Thus, 



n • 



d „ 

— n • V + 



dn 

ds 



• V 



(2.18) 
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Fig. 2.2. Photon propagation angle in 
spherical symmetry 



For example, in general spherical coordinates we will have derivatives in the 
space coordinates 9/9r, 5/90, and and the angles that specify the direc- 
tion of the propagation vector n, 5/50 and 5/5^. In the special case of spher- 
ical symmetry, 5/50 = 5/5<^ = 5/5# = 0. Furthermore, dr = cos0ds = /ids, 
and rd0 = — sin 0ds = — (1 — /i^) 2 (see Fig. 2.2), hence 



5_ 5 sin05_ 5 (1 - /i^) 5 

ds. " 5r r 50 ^ dr r 5// ’ 

which implies the transfer equation is 



(2.19) 



15/ 5/ {l-/i^)dl , ,,,,, 

Exercise: derive the transfer equation in general spherical coordinates. 
Optical Depth. Define the optical depth as 



r„(a:,®' 




x(x + n$;n,i/)ds . 



( 2 . 21 ) 



Physically, r gives the number of photon mean free paths along the ray. In 
planar geometry it is customary to use the the vertical or radial optical depth, 
which is the number of mean free paths downward into the medium as seen 
by an outside observer. Thus, 



dvi/ — , 



( 2 . 22 ) 



or 
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Tu 




Xi^{z')dz' . 



(2.23) 



Along a ray propagating such that n • k ~ /j,, the slant optical depth is 

Source Function. The source function is defined as the ratio of emissivity to 
opacity: 



S{x,t;n, 1 /) 






(2.24) 



Then in static (or steady-flow) media we can write the transfer equation in 
the customary form 

- 5^. . (2.25) 

OT^i/ 

There are many ways of integrating this equation (and its generalizations). 
For certain special cases we can write explicit expressions for the source 
function. A few examples, in the comoving frame of the fluid are: 

Strict LTE: 



S{vo) = B{uo,T) . 

Thermal emission/absorption plus isotropic coherent scattering: 

XoM = Ko{oo) +ao{oo) , 

= k.o{j'o)B{i'o,T) + ao{oo)Jo{i'o) , 

nr s _ kq{i/q)B{uo,T) -h aoiuo)Joiuo) 

KoM+aoii^o) 



(2.26) 

(2.27) 

(2.28) 

(2.29) 



2.3 Moments of the Transfer Equation 

For simplicity work in Cartesian coordinates. The transfer equation is 



1 dl{x,t\n,u) ^ _j5/(®,(;n, I/) 
c m 



ri{x,t-,n,u) 

-X{x,t-,n,u)l{x,t-,n,u) . (2.30) 



Take the zeroth moment of (2.30) over angles, i.e., multiply by du;/47r and 
integrate over solid angles. We find 



1 dJ{x^t; u) 
c dt 



-h V • H{x,t\v) 



47T 







(2.31) 
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or, converting to physical units, we get the monochromatic radiation energy 
equation 

dE f 

+ V ■ Fu = j> [nix, t] n, I/) - x(x, t; n, v)I{x, t; n, u)] dw . (2.32) 

Now integrate over all frequencies to obtain the radiation energy equation 
dE f 

—+V-F = j Av j[nix,t-,n,v)-xix,t-,n,u)Iix,t-,n,v)]duj .{2.ZZ) 
This equation states 

Rate of change of the radiation energy density in a fixed volume 
= the rate of emission from the material 

— the rate energy is absorbed by material 

— the net fiow of radiant energy through the boundary surface 
of the volume . 

Note that Eq. (2.33) is in tensor form so we can use it in other coordinate 
systems. For example, in spherical symmetry 

^ 1 djr’^F) 

dt dr 

roo pi 

= 27t du [r;(x,t;n, i/) z/)]d/i . (2.34) 

Jq J-1 

If we define the luminosity to be L{r,t) = 47rr^F(r, t) then we can write 
(2.34) as 



dE 1 dL 
dt 47rr^ dr 



pOO p\ 

- 27T y d// y [ri{x,t\n,u) - x{^,t\n,u)I{x,t\n,u)]dii . (2.35) 

In the limiting case of a static medium we have d/dt = 0. Furthermore, in 
order to achive a steady state the material must emit exactly as much energy 
as it absorbs. Therefore 

poo 

/ in- XIHi' = 0 , (2.36) 

Jo 



which implies that V • F’ = 0. This is the condition of radiative equilibrium. 
Note that (except for highly artificial situations) radiative equilibrium can 
occur only in absolutely static media. The instant the material moves or 
evolves in time, other terms come into play. This will become clearer later 
when we study moments of the Lagrangian transfer equation. 
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Now take the first angular moment of the transfer equation. Multiply by 
nduj/A'K. We find 

1 dHt dKf/ 



c dt 



* 1 / 

^ iri- f (2-37) 



or, converting to physical units, we get the monochromatic radiation momen- 
tum equation 

1 dF^ 
dt 

Integrating over frequency we get the radiation momentum equation 



P If 

^4-V-Pj, = ~ ® [77(x,t;n,i/)-x(a^,i;n, i/)/(x,t;n,z/)]ndu; .(2.38) 



1 OF 
dt 



+ V-P 



~ \ j . (2.39) 



Recall that F/c^ equals the radiation momentum density, P is the radi- 
ation momentum flux density, and on the right-hand side one can write 
{r]lhv) X [hvlc)n ~ (number of photons) x (momentum/photon) emitted. 
And similarly for the number absorbed. Thus, 



Rate of change of the radiation momentum density 
in a fixed volume 

== the rate of momentum input to the radiation field 
from the material 

— the rate of momentum removal from the radiation field 
by the material 

— the net rate of radiant momentum through 
the boundary surface of the volume . 

Note in passing that 

/r = di/^ x(®> t\ n, v) /(x, t\ n, v)n du) (2.40) 

is the radiation force on the material, per unit volume. 

Again, Eq. (2.39) is in tensor ial form, hence it can be used in any coordi- 
nate system, e.g., in spherical symmetry: 



1 dF dP SP-E 
dt ^ dr ^ r 
27T 

= — / du [r){x,t,n,u) -x{x,t;n,u)I{x,t;n,u)]fj.d(j, . {2A1) 
^ Jo J -1 
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Static Moment Equations, In a static medium, we can take v = 0, and 
d/dt = 0. Further if we assume planar geometry, we get the standard moment 
equations used in the theory of stellar atmospheres: 

dF 

- cXvEv (2.42) 

and 



dP. ^ djfM _ 1 

dz dz c 



(2.43) 



Here we have noted that the net contribution to the momentum balance by 
isotropic emission is exactly zero. 

The Closure Problem. In solving problems of radiation hydrodynamics we 
undertake to calculate the evolution of the total (i.e., material plus radiative) 
energy and momentum density in the flow. The radiative contributions to 
these quantities follow from the radiation energy and momentum equations. 
In order to close the system of radiation momentum equations we must know 
the Eddington factor ft, (or, in general, the Eddington tensor fj, which allows 
to express Ft, in terms of Ei,. 

We must therefore be able, somehow, to calculate ft, (or the components of 
ft,). For ordinary material fluids we can either impose a closure by assuming 
isotropy of the microscopic velocity distribution (equilibrium flow) or cal- 
culate the departure from isotropy analytically (e.g., the Chapman-Enskog 
solution) because X/l 1. The same is true for radiation when Ap/^ 1 
(that is, in an opaque medium such as in the interior of a star). In this case 
we can develop the diffusion limit solution for radiation, which is similar to 
the Chapman-Enskog solution for matter. But when \p/£ > 1, i.e., near a 
boundary surface or in a very transparent medium, we must compute the clo- 
sure from a full angle (and frequency, if relevant) dependent solution of the 
equation of transfer. We shall not have time here to discuss this important 
problem other than to say that methods for making this closure exist. 

Notice that the fluid flow responds only to the total (i.e., frequency- 
integrated as opposed to monochromatic) radiative momentum and energy 
input into the material. Thus, one usually replaces the detailed frequency- 
variation of the opacity of the material with appropriate frequency averages 
over the spectrum. Again, we shall not have time here to discuss these issues. 
Suffice it to say that in the past 5-10 years very impressive advances in our 
knowledge of the opacity of stellar material have been made. 
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3. Lorentz Transformation of the Transfer Equation 

Let us now begin the study of the behavior of radiation in fluid flows^ i.e., in 
moving media. The motivation for this study is that radiation can dominate 
both the energy content and the energy transport in a radiating fluid. We 
can get an intuitive feeling for when this happens by use of a couple of 
dimensionless numbers. 

Energy Content. Define R to be 

^ energy density in the material e ^NkT 0.02SN 
energy density in the radiation E 

One can easily show that 1, so that e ^ E, when TkbV ~ where 

the density g is measured in g cm“^ and the temperature corresponding to 
1 KeV is approximately 10^ K. For example: 

Center of the Sun, g ~ 100, T ~ 10^, and R ^ 500, so that radia- 
tion is unimportant. 

Center of an 0-star, ^ ~ 3, T ~ 4 x 10^, and R 1, so that 
radiation is important 

Energy Transport. The importance of radiant energy transport is measured 
by the Boltzmann number 

gCpTv material enthalpy flux 
(7 rT^ radiation flux 

Bo is essentially {v/c) x i? as defined above. Usually {v/c) 1, therefore the 

radiation almost always dominates the energy transport (counterexample: 
convection in a star). In optically thick media one should replace ctrT^ in 
(3.2) with where Teff gives a measure of the net radiant flux in the 

medium. 

In the analysis of the radiation equations one must carefully account for 
terms that are 0(u/c). These are required for consistency between frames 
(Eulerian ^ Lagrangian). In some cases these seemingly small terms can 
dominate all others in the equations! [Remark: the frame dependence of ma- 
terial terms is 0{v^ l<?) while for radiation it is 0(u/c). Clearly the effect is 
much larger for radiation.] References: [31, 4]. 

3.1 Lorentz Transformation of the Photon 4-Momentum 

The photon 4-momentum is 

M“ = (^)(l,n), (3.3) 

where v and n are the photon’s frequency and direction of propagation in the 
laboratory frame. If we boost (Lorentz transform) to another frame moving 
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with velocity v with respect to the laboratory frame, and in particular choose 
this frame to be the comoving frame of the fluid, then applying a Lorentz 
transformation to we find: 

I/O = 71 / . (3-4) 

where 7 = 1/^1 — ](?. These four equations, when expanded to 0(t>/c) 

yield the effects described classically as “aberration” and “Doppler shift” . For 
one-dimensional motion along the z-axis, equations (3.4) and (3.5) simplify 
to: 




(i^o, I'orixo, ^oriyo, i^orizo) = [t^^(1 - n^l3),un^, vriy, - (3)] , (3.6) 



where P = v/c. Note that the photon direction in the (x,y) .plane is un- 
changed. We can rewrite the above expressions in planar or spherical geom- 
etry as: 



\uo,Ho,{l- = 



7 z/(l - /3/i), 



(M-^) .1 



(3.7) 



We can obtain the inverse transformations from the comoving frame to the 
laboratory frame by interchanging v (etc,) with i/q {etc.) and also changing 
V —V. From the above general transformations we can easily show that 



di/Q = 




dfio = 




and 



(3.8) 

(3.9) 



d#o = • 



(3.10) 



Recall that do; = d/xd#. Then from (3.8-10) we can show that the quantity 
udl/duJls^i Lorentz invariant^ i.e.f 



1/di/du = i/Q duo dcjo , (3.11) 

a result which we will use repeatedly. The invariance of u du duj is actually a 
special case, for photons, of the general result that for any kind of particle 

dV = (3,12) 

e e 

is a Lorentz invariant, where e = energy per particle. This is a standard 
result of relativistic kinetic theory. For photons, p = hu/c,e = hv\ then the 
connection between (3.11) and (3.12) is obvious. 
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3.2 Lorentz Transformation of the Specific Intensity, 
Opacity, and Emissivity 



A very simple and elegant way of obtaining the Lorentz transformation prop- 
erties of the physical quantities in the macroscopic transfer equation was 
devised by [31], who used thought-experiments based on counting photons, 
which is invariant between frames, to connect comoving-frame and lab-frame 
quantities. 

Specific Intensity, Have an observer in the lab frame count the number of 
photons passing through a surface dS stationary in that frame. The result 
will be 

N ^ dududt . (3.13) 

hu 

Now put the observer on a fluid element moving with a velocity v to the right 
in the laboratory frame, and have him again count the number of photons 
passing through the same surface dS. He now sees the surface moving to the 
left with a velocity v with respect to his frame. So he now flnds 

No = [fiodSdto + '^dSdto) . (3.14) 



In (3.14) the first term is the number counted if dS were stationary in this 
frame, and the second terms is the photon number density (7o Ihuoc) x the vol- 
ume (u d5d^) swept up by the moving surface. But because we have counted 
the same photons in both frames, N must be identical to A^o- Furthermore, 
we know that dto = ydiyU = -h 0fjiQ)yi/dudu ~ uodu^duQ,, and p = 

{po + /?)/(! + Ppo)‘ Using these relationships to connect (3.13) with (3.14) we 
get {I/i/)pdjydu;dt - {Io/i^o)di/oduo{po -\- P)jdt ~ {Io/uo)duoduop{u/v'o)dt 
which implies 



/(m. t') _ ^o(Mo.t'o) 

l>0 

Thus, 



(3.15) 



I(/x, I/) = (3.16) 

is a Lorentz invariant, known as the invariant intensity. Recalling that the 
photon distribution function /r oc we see that we could have gotten the 
same result by invoking the Lorentz invariance of the distribution function. 
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Emissivity. Have an observer in the laboratory frame count the number of 
photons emitted from a definite element of material in dV into dudu in dt. 
The result will be 

^ ^ Tjjij.,u)dududVdt ^ 

hv ' \ ■ ) 

Now have the observer count again in the comoving frame of the fluid (the 
same material). Now the result will be 



No 



T?o(i/o)da;odr'odyodto 

huo 



(3.18) 



Here we have assumed isotropy of the emission coefficient in the fluid frame. 
Now recall that dV dt is invariant, and ududu is invariant. Therefore we have 




(3.19) 



Thus, 

S=^ (3.20) 

is a Lorentz invariant, the invariant emissivity. 

Opacity, Now count the number of photons absorbed in a definite element of 
material. In the laboratory frame the result will be 



N = ‘^Bif ^i^dujdiydVdt (3.21) 

hu 

and in the comoving frame we get 

No = dcJodi/odVbdto • (3.22) 

huo 

Again demand N = No, and use the invzu-iants identified above, to get 
x(m,'^) = (^) Xo(j^o) • (3.23) 

Thus, 



A = vx 



(3.24) 



is a Lorentz invariant, the invariant absorptivity. 
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3.3 Lorentz Transformation 

of the Radiation Stress Energy Tensor 



One can guess the form of R, the full radiation stress energy tensor by 
using the same technique that works for the material stress energy tensor M. 
Thus, we can demand that the space components give back the the radiation 
pressure tensor derived before. Then 

Rij pij = f /rM'ctiM^M , (3.25) 



which equals (number of photons/cm^ /phase volume) x {ith component of 
momentum) x (jth component of velocity) integrated over all space, which 
is correct. 

Now for a photon, cn^ = (? {hun^ jc) {^Ihu) = So try^ 

pcc0 = . (3.26) 



By inspection we see that this choice has all the desired properties: It is 
manifestly a 4-tensor, being the outer product of two 4- vectors and 

times the 4-invariants /r and d^M/e. 

Now examine the other components of R: 



= j f^hud^M = (number of photons) x (energy /photon) 

= radiation energy density , (3.27) 






fYihucn^d^M = - x (radiation flux density) , 
c 



(3.28) 



= cj 

— c X (radiation momentum density in zth direction) . (3.29) 

Therefore we have exactly the right interpretations. One can, in fact, apply 
Eq. (3.26) to any particles, not just photons. In order to shift to the macro- 
scopic picture, use the identities (see Sect. 1.3) 



/r 




(3.30) 



and 



d^M = —i^duduj . (3.31) 

^ We use Greek indices to denote components in the four dimensional space time, 
while Latin indices refer to components in the three dimensional Euclidian 
space. 
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Then 



— ^ da; 7(n, u) . 


(3.32) 


This tensor is also manifestly covariant: it is composed of the invariant 
{Ifi/^)x the outer product of the photon 4-momentum with itself x the 
invariant (udi/du). Alternatively we can write 


/ B f/c\ 

"-(f/c Pj- 


(3.33) 


To express the radiation stress-energy tensor in the comoving frame, we 
merely append the subscript “0” to all radiation quantities (7, /i, i/, n, F, 
and P). To obtain the transformations laws for the components of R from one 
frame to another, boost according to the standard Lorentz transformation 
rules. In general 3D the results are rather complicated. But for ID flows, 
they are simple: 


E = -y\Eo + ^Fo+^^Po) , 
c 


(3.34) 


F = 72[(1 + /?2)Fo + vEo + vPo] , 


(3.35) 


P = j^(P^ + ^Fo+/3^Eo) . 

C 


(3.36) 


To 0(u/c), useful for many applications, 




E = Eo + —Fo , 
c 


(3.37) 


F = Fq + vEq + vPq , 


(3.38) 


P = Po + ^Fo . 


(3.39) 



As we shall see later, Eq. (3,36) has important consequences. It should be 
noted that we can apply standard Lorentz transformations only to angle- 
frequency integrated quantities, as only such quantities can truly be tensors 
on a 4 manifold Transformations for and P,^ can also be de- 

rived, but are much more complicated; they also follow directly from the 
transformation laws for I 

It is a worthwhile exercise to derive equations (3.37-39) by using the 
usual definitions of and P and first-order transformations for (/x,i^). 

For example: 



E = 



1 

c 




I da; di/ , 



(3.40) 



Idi^dtj = —udi/duj = — ^ f^ i^odz^oda; = 
1/ i' I'n 




7odi;oda;o , 



(3.41) 
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V 

therefore, 



— 1 4- /3/io ( — j — 1 + 2^/Zo , 

^0 , 



and similarly for F and P. 



20 

4 - 2y3//o)di>'odci;o = 4 Fq 

c 



(3.42) 



(3.43) 



3.4 The Radiation 4- Force Density Vector 

We expect the dynamical equations for the radiation field, as for matter, to 
have the general form 

= -G" , (3.44) 

where G^ is the radiation force density acting on the material. Thus, G^ must 
be (the net rate of radiative energy input to matter), while G* must be 
the (net rate of radiative momentum input into the matter). Therefore 



= ^ J du ^ dcj[x(n, u) - r/(n, i/)] 


(3.45) 


and 




= ^ J ^ dcj[x(n, u)I (n, i/) - r]{n, i/)] . 


(3.46) 


The above vector is manifestly covariant: it is the 4- vector u{l^n) times the 
invariants z/di/du;, 2 ind summed over phase space. 

With this choice for G" and our previous expression for (3.44) is 

indeed a covariant conservation relation for the radiation field.^ Transforma- 
tion relations for G“ are obtained by boosting. These are simple for one- 
dimensional flow: 




(3.47) 


Here, in the laboratory frame. 




O'TT 

G® = — / du dn[xifJi,i')I{fJ.,i') -rjifi,!/)] , 

c Jo J~1 


(3.48) 


27T 

G* = — / du dfj.fj.[xifi,u)I{ti,u) -ri{n,u)] . 

c Jo J -1 


(3.49) 



A relation (physical law) that preserves its mathematical form under a trans- 
formation between inertial frames is called “covariant”. 
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In this frame the double integrals are irreducible because of the anisotropy 
of the material coeflScients. However, in the comoving frame 



1 

<^o = - / [cXo(i^o).Eo(j/o) - 47r7jo(^'o)]di/o , 
c Jo 

1 

^0 “ ” / Xo(*^o)^o(*^o)d^'o • 

c Jo 



(3.50) 

(3.51) 



In the comoving frame we have single integrals because the material coeffi- 
cients are isotropic in this frame. Isotropy of photon emissions in the comoving 
frame also implies the net momentum transfer from material to radiation is 
exactly zero, hence there is no term from the emission coefficient in (3.51). 



3.5 Covariauit Form of the Transfer Equation 



Is the transfer equation covariant, or not? This question has led to much con- 
fusion in the physical and astrophysical literature. Here we shall just sketch 
some of the approaches that have been made to answering it, most of which 
have serious deficiencies, leaving a definitive analysis to the next section. 
One very basic point is to remember that the specific intensity is a seven- 
dimensional distribution function, not a 4- vector or 4-tensor. Thus, we cannot 
expect simple Lorentz transformation to apply, but must account for the 
behavior of the tangent spaces (i.e., momentum phase space) attached to 
each point of the four-dimensional spacetime manifold. 

Consider transformation from one inertial frame (say the laboratory 
frame, at rest) to another inertial (i.e., uniformly moving) frame. 

1) .We can derive the transformation properties of a 
{dlcdt.d/dx.dldy.dldz), by expanding with a chain rule: 



dt dt dV dt dz* 
dz dz dV ^ dz dz* ’ 



(3.52) 

(3.53) 



and using the Lorentz transformation to evaluate da/dt and dajdz^ where 
a = t' or z'. We obtain 



/ 1 _^ A A 

\cdV dx^ dy' dz) 



7 



a a ^a.^av 

\cdf dz^ ) ^ dx'^ dy^^^ \dz^ cdt') 



Rearranging, we find 



(3.54) 
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Id 

+ n • V = 

c at 



fu'\ 


[19 r „/l 


— 


+ n • V 




cdv 



(3.55) 



where we used the results obtained earlier for the transformation of i> and n. 
Therefore, [1 j c){dl / dt) + n • V7 = rj — xJ transforms to 




1 ^ 






\ r / £/ \ 3 

'} [(17) 



(3.56) 



Now between two uniformly moving frames the ratio is independent 

of hence it can be taken through the derivatives. Therefore we get 

-^ + n' -Vr = ri' -X'l’ ■ (3.57) 

c or 

Equation (3.57) implies that the transfer equation is covariant in the sense 
that it has the same algebraic form in all inertial frames. But in a sense this 
is a trivial result because we know from experience that we can write the 
same transfer equation in a uniformly moving medium (say a star) as in the 
laboratory frame simply by taking into account the Doppler shift Doppler 
induced by the motion of the star relative to the laboratory frame. 

2) Alternatively we could argue that the 4-gradient of the 4-invariant 
Z, should transform like a covariant 4- vector, so that the inner product 
should be an invariant. Thus, 



^ [nin, - x{n, I') I{n, = + = 

= + (3,58) 



By converting the material properties in the first expression of (3.58) to the 
comoving frame and equating the result to the last expression, we again find 






(3.59) 



Although this argument looks plausible, it misses the point: Z is a distribution 
function^ not a. scalar^ so we have no way of knowing that Z« is a covariant 
vector (in general it is not). Thus, we cannot actually claim that M“Z« is 
an invariant. Furthermore, this “derivation” doesn’t apply if the frames are 
not moving uniformly with respect to one another because then we can’t 
take the factor through derivatives. And, of course, nonuniform flow 

where v = v{x,t), i.e., there are gradients in space and accelerations in time, 
is precisely the case we wish to study. 
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So how should we proceed? Recall that the particle Boltzmann equation 



is 



df idf 





(3.60) 



Here {D/Dt) is the standard Lagrangian time derivative. The covariant gen- 
eralization of (3.60) is 



V dr J dx°‘ \ dr J 5p“ \5t J 



(3.61) 



Here {S/5t) is the intrinsic derivative with respect to time, calculated along 
the path followed in phase space. 

We can apply Eq. (3.61) to photons. For convenience use units in which 
c = /i = 1. A technical problem to be faced is that photon world lines lie on 
the null cone, so At = 0. To get around this problem we replace r with some 
other affine pathlength i. Define £ such that 




Then rewrite the Boltzmann equation as 



(3.62) 






dp°‘ 




(3.63) 



where 



^ de 



(3.64) 



For photons, / /r,p“ M“, the source term = e (emission), and the 

sink term = — o/r (absorption), so that the photon Boltzmann equation is 



M“ 



dfR 

dx°‘ 



+ 



dfR 

dM°‘ 



= e- o/r . 



(3.65) 



Or, in terms of the invariant intensity, emissivity, and absorptivity defined 
earlier. 



dl di 

M -z +M“7T7^ 

dx°‘ 



= S-M. 



(3.66) 



Equation (3.66) applies in all frames. In particular it applies in the inertial 
frame. In such a frame photons move on straight lines with constant fre- 
quency. Therefore, ignoring general relativity effects, M“ = 0. It is then we 
get 



=^S-AI, (3.67) 

which is the equation studied above in connection with Lorentz invariance of 
the transfer equation. As we have just seen, this equation is valid if and only if 
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= 0, i.e., only for uniformly moving frames. In a noninertial frame, e.g., a 
moving fluid, photons still move on geodesics^ but now the geodesics aren’t 
Euclidian straight lines, and the photon frequency isn’t constant because, as 
seen by observers attached to different fluid elements, each having its own 
v{r,t), a photon with fixed lab-frame (z/, n) has a different at each 

fluid element. 

We can write Eq. (3.66) in a form which is similar to Eq. (3.67), namely 

M^I|« = (3.68) 

where is an operator which denotes derivatives with respect to taken 
subject to the constraint that photons remain on the null cone as seen in 
the (noninertial) fluid frame. The physical content of (3.66) and (3.68) are 
identical, whereas (3.67) is quite different from either of the others. When we 
can evaluate either (3.66) or (3.68) for a specified geometry and flow, we will 
have derived the Lagrangian transfer equation. 

To prevent misunderstandings characteristic of early work, it is essential 
to bear in mind that / or J or /r are distribution functions. They exist, and 
describe the behavior of photons, not only in the four-dimensional spacetime 
{r,t) but also in attached three-dimensional tangent spaces p or (z/,n). All 
seven variables are independent, therefore when calculating the variation of 
I or /r along a path, one must remember, and differentiate with respect to, 
all the independent variables. 



4. Inertial-Frame Equations 
of Radiation Hydrodynamics 

4.1 Inertial-Frame Radiation Equations 

Let us work in the laboratory frame where the fluid moves with velocity 
v{r,t). We already wrote the correct radiation energy and momentum equa- 
tions: 



dt ax* 



= J du ^ [r]{x,t-,n,i') — xixjt'tn,!/) , 



(4.1) 



^ ^ , 

c2 dt xi 

=\jyjvn 



x,t;n,u) — x{x, t; n, u) I{x, t; n, v) ] n‘dw . 



(4.2) 
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These are just R^p = -G^ and R^p = ~'G\ So what is new here? Actually 
nothing. Then what is the problem here? The right-hand sides. They are 
messy because they contain hidden in them a host of velocity-dependent 
terms resulting from Doppler shifts. In most problems it would be essentially 
impossible to do the work (usually interpolation in tables) to account for these 
effects. Do we really need to worry about Doppler-shift-induced anisotropies 
of X ^nd r/? After all, v/c 1. The usual response in the literature has been 
to ignore the u-dependent terms and to evaluate G^ and G^ as if rj = rio{i/) 
and X = Xo{^) f^he laboratory frame frequency. However, this is a bad 
idea. To see why, we must examine the effects of 0 (t;/c) terms. Start with 
the transfer equation. 

Mixed-Frame Equations. Choose {u,n) in the lab frame. Then evaluate x 
and z/ at z/Qj where 



/ n 

1/0^ —j . 

Then, using (3.23) and (3.19) and expanding to 0{v/c) we get 

= (^) XoK) = XoH - (-^) 
and similarly 

T)(n, I/) = T]0{v) + 2rio{i') - 



XoM + 



(4.3) 



(4.4) 



(4.5) 



Hence the transfer equation is 



1 dl{n,u) 
c dt 



+ n • V/(n, I/) = ■noil') - Xo(i')Iin, v) 



+ 




2%(i^) - i'^ + 



Xo{i') + i'^ 




(4.6) 



This is a mixed-frame transfer equation: /, z/, and n are measured in the 
laboratory frame, whereas xo and t)q are measured in the fluid frame, which is 
advantageous because these quantities are isotropic in that frame. Integrating 
over angle, we get the monochromatic radiation energy equation 



dE, dFi 

dt dx^ 



= iwnoii') - cxoME^ + 




Xo(i') + V 



dxo 

dv 



(4.7) 



and the monochromatic radiation momentum equation 



1 dF^ dPt^ 
(? dt dx^ 



1 

-Xo{i')Fl + ^^ 
c c 



Xoii') + i'^ 



+ 




(4.8) 
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where we used the result that §n^n^d(jj = ( 47 t/ 3 )( 5 ^-^. Integrating over fre- 
quency we get the radiation energy equation 



dE dF^ 
dt ^ dx^ 



f 



[4nT]o{iy) - cxo(i')E,,]di> 



■f 

-ccr 



V. 

+ — 
c 



Xo{u) + u— 



F!, du 



(4.9) 



Note that the absorption-emission term on the right-hand side is not -cG^: 
both i/ and E,y are in the laboratory frame. Similarly, taking the first angular 
moment we get the radiation momentum equation: 



1 dF^ 

(? dt ^ dx^ 



1 

= — / 
C Jo 



.'1 r 

C Jo 



Xo{i^) + 1' 
= -cG^ . 



dxo 

dv 



47TV* 

P'-'di/H 5 - / r]o{u)dv 

c Jo 



(4.10) 



Again note that the first term on the right-hand side is not —cG^: both 1 / 
and Fu are in the laboratory frame. 

On the Importance of {v/c)Terms, What are the above equations trying to 
tell us? To extract the physical flavor, let us make a simple analysis for grey 
material. Thus drop the terms in dxo/du and d7]o/du. We then have 



dE ^ 
dt dx^ 



= k(47tP - cE) + -ViF^ 
c 



and 



1 dF^ dP^^ 
(? dt ^ dx^ 



K 

c 



-F* -f VjP^^ -h V 




(4.11) 



(4.12) 



Consider the energy equation first: 



1 ) We instantly see that if we drop the 0{v/c) term v^Fjc^ we have omit- 
ted a term equal to the rate of work done by the radiation on the material, 
which can be quite large when the radiation field is very strong. Clearly this 
is not a good choice if we want to do radiation hydrodynamics. 

2) Next, examine the diffusion regime. Prom the Lorentz transformation 

cE = cE^ -h 2v - F^/c -> inB -h 2v • F/c -h 0{v‘^/c^) . (4.13) 



Using this result in Eq. (4.11) we have 
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Equation (4.14) makes the physically correct statement: (the rate of change 
of the radiation energy density in a fixed volume) plus (the rate of work done 
by the radiation on the material) equals the net rate of radiant energy inflow 
into the volume. We thus reach two important conclusions: (a) omission of 
0[v/c) terms leads to an error of the order of the net absorption-emission 
terms, and (b) we get the correct first law of thermodynamics for radiation 
only if we keep the 0(t;/c) terms. 

3) Prom a simple dimensional analysis one can show that {k,v • F/c) is 
0(^v/Apc) 1 relative to V • P, Hence in the limit of dynamic diffusion, 

(^/c) > (Ap/^), the velocity-dependent terms may actually dominate the 
energy balance. 

Now consider the momentum equation. Examine the diffusion regime. In 
that limit one can drop the term c~^{dFldt) because by dimensional analysis 
one finds that it is 0(Apu/^c) 1 in comparison with KFjc. Thus we get 





(4.15) 


In the diffusion limit 






(4.16) 




(4.17) 


Fo-^--V-Po. 

K 


(4.18) 


Thus (4.15) simply states 






(4.19) 



which is the standard Lorentz transformation of the flux, simplified to 0(u/c). 
Therefore, unless we keep 0(i;/c) terms in the transfer equation we fail to 
discriminate F from Pq- Is this important? Consider the diffusion limit: 
there we have {vEo/F) {vaT"^ / c) / {aT^^) (t;/c)(T/Teff)'^. Recall that 

in the Sun, the ratio of the central temperature to the effective temperature 
(Tc/Teff) ~ 10^. So even if {vjc) is only 10“^^, the advected flux can exceed 
the diffusive flux. (Note: this problem doesn’t arise in the comoving frame.) 
Thus it is clear that in the laboratory frame formulation it is important to 
analyze (v/c) terms. 
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Dimensional Analysis. 

Rules: 

1. (d/dx) -> {!/£) 

2. (d/dt) ^ l/{i/v) = {v/£) 

3. We can drop any term that is always at most 0(u/c) in comparison 
with the dominant term. 

4. But we must keep in all regimes any term found to be comparable 
to the dominant term in any regime. (Otherwise we won’t get the 
transition from one regime to another right.) 

5. Grey material. 

Regimes: 

1. Streaming limit: Xp/£ > l;E ^ P;F cE 

2. Diffusion limit: Xp/£ 1;P ^ ~E 
3.) Static diffusion: v/c<^ Xp/£ 



F Fq, therefore, 




^0 




E 



h) Dynamic diffusion: u/c > Ap/^ 



F vEq, therefore 



k{cE - 47rJ3) ^ -^{vE) ~ O E 
ax \£J 

Begin with an analysis of the transfer equation following the rules stated 
above. 





A 


B 


C 


D 




c di 


+ n ■ V/ 




+ (=^)(2r? + ; 


Streaming: 


{v/c) 


1 


(^/Ap) 


(t;/c)(^/Ap) 


Static diffusion: 


[vie) 


1 


(Ap/^) 


(w/c)(^/Ap) 


Dynamic diffusion: 


{vie) 


1 


{v/c) 


{v/c){i/\p) 



Thus in the streaming limit, we need only the spatial gradient term B and the 
source-sink term C, which yields the standard static transfer equation. Term 
A is needed only in a radiation front. For static diffusion we need terms B and 
C, and drop A. Normally we can also drop term D. But if (Ap/^) is so small 
that {v/c) > (Ap/^)^, then the velocity- dependent term may become the 
dominant source-sink term on the right-hand side. In the dynamic diffusion 
regime we need terms B and D. 
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In summary, in order to guarantee a correct solution of the mixed-frame 
transfer equation we must retain the spatial operator on the left-hand side, 
and all terms on the right-hand side. 

Next, consider the energy equation. 



ABC D 

If +V-F =k{A'kB-cE) 



Streaming: (^/c) 1 

Static diffusion: {v/c){i/Xp) 1 

Dynamic diffusion: 1 1 



(i/Xp) iv/c){i/Xp) 

1 {v/c){e/Xp) 

1 {v/c){e/Xp) 



In the streaming limit we normally need terms B and C, but can drop the 
(d/dt) term A and the velocity-dependent term D. We need to include term A 
if we try to follow a radiation front. For static diffusion we normally need only 
terms B and C and can drop terms A and D. But note that as (v/c) (Ap/^), 

all terms are of the same order and all must be retained. In the limit of 
dynamic diffusion, we need all terms, and in fact the velocity-dependent 
terms may dominate all others. 

In summary, to guarantee the correct solution of the mixed-frame energy 
equation in all regimes we must retain all terms. 

Finally, consider the momentum equation. 



ABC D E 

+V-P +(^)(4^„)+(|)(„.p) 



Streaming: 1 

Static diffusion: (^^/c)(Ap/^) 1 

Dynamic diffusion: 1 



(^/Ap) {v/c){e/X^) {v/c){i/x^) 

1 {v/c){e/Xp) (i;/c)(£/Ap) 

(u/c)(£/Ap) {v/c){e/Xp) {vlc){tlXp) 



In the streaming limit we must retain the basic terms B and C. We can drop 
the time-dependent term A compared to B unless we wish to track a radiation 
front. We can drop the velocity-dependent terms D and E in comparison with 
C. In the static diffusion limit we can normally drop the time- and velocity- 
dependent terms A, D, and E in comparison with B and C. However, note 
that if v/c becomes comparable to Ap/^, then we must retain all terms on 
the right hand side. In the dynamic diffusion limit we can drop the time 
derivative term A, but we must keep all terms on the right-hand side. 

In summary, in solving the mixed-frame radiation momentum equation, 
we can guarantee the correct solution in all regimes only if we retain all terms, 
except d/dt (which will also be needed if we follow radiation fronts). Thus, 
in the mixed frame there axe no simplifications, no short-cuts. Basically all 
the terms in the equations must be kept. We will find a different result when 
we work with Lagrangian (comoving-frame) equations. 
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4.2 Inertial- Frame Equations of Radiation Hydrodynamics 

The equations of radiation hydrodynamics can be written in covariant form 
as 

Radiation: 

= -G“ . (4.20) 

Matter: 



view 1: Mf = , (4.21) 

view 2: (M“^ + = F“ . (4.22) 

Here is the material stress-energy tensor, Jf?“^ is the radiation stress- 
energy tensor, is an external 4-force, and is the radiation-matter 4- 
force. Equations (4.21) and (4.22) are based on two different, but equivalent, 
views. In (4.21) we have an external force (e.g., electromagnetic, gravita- 
tional, etc.) plus a radiation force acting on the material. In (4.22) we have 
an external body force acting on a radiating fluid. In view of (4.20), (4.21) 
and (4.22) are mathematically identical; nevertheless “view 1” is intuitively 
more natural in the streaming limit, whereas “view 2” is more natural in the 
diffusion limit. 

Recalling the L.H. Thomas expressions for and combining them with 
the radiation terms, we get the relativistically correct equations: 



(piC^ - p),t + (QiC^V^)j = VjP + cG° 


(4.23) 


and 




-b ~ fi~ P,i + Gi , 


(4.24) 


or 




[gic^ -p + E)^t + [qic^v^ + - vjP 


(4.25) 


and 




(g^Vi + c-^F'),t + (giviv^ + = /' - F^Pj . 


(4.26) 


Here, using Thomas’ notation, qi == 7^^>ooOj where ^>ooo — (l + ^ - To 

cancel the large rest mass terms from the energy equations above, subtract 
times the continuity equation from them, to obtain: 



[(7 - + (7^ - l)p],t 

+ {[(7 - 1)qc^ + 7 pe + 7 ^p]t)‘},i = ViP + cG° (4-27) 



or 
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[(7 - 1)qc^ + jge + {■f - l)p + Elt 

+ {[(7 “ + 7 ^e + 7 ^p]v* + F*}.j = Vif' . (4-28) 



In (4.27) and (4.28) the term (7 — \)q(? is the kinetic energy of the fluid 
and the term is its internal energy. Let us now reduce these equations 
to 0(u/c) to derive equations that can be applied in Newtonian radiation 
hydrodynamics. 



Momentum Equation. Thus defining = 7 ^ 000 j we can manipulate (4,24) 
and (4.26) into the relativistically correct momentum equation: 



Dv . ^ 

e*-pT = f ~^P 

Dt c 






+ G--G° 
c 



(4.29) 



or 



Dv , _ V f dp 

fldF„\ V fdE „ \ 



(4.30) 



On the right-hand sides of (4.29) and (4.30), the first term is the externally 
imposed force, the second term is the force from a pressure gradient, the third 
term is the radiation force, and the fourth term accounts for the change in 
the equivalent mass density of the material resulting from its interaction with 
radiation (remember: energy mass ^ inertia => dynamical effect). 

The velocity-dependent terms in equations (4.27-30) are logically essen- 
tial, for they guarantee consistent equations spanning the optically thick/ thin 
transition, and between the laboratory frame and the Lagrangian frame. 
Therefore, in discussion of such points one must retain all of them. But for 
practical applications in Newtonian flow (small v/c) we can omit any terms 
that are at most 0(u/c) in all regimes. It is easy to show that {v/c)G^ is at 
most 0(v/c) compared to G; therefore we can drop it. Likewise, in (4.30) one 
can show: 





A 


B 


c 


D 




_ 1 OF 
^ dt 


-V-P 


+^(f) 


+^(V-F) 


Streaming: 


(vie) 


1 


{v/cf 


{vie) 


Static diffusion: 


(u/c)(Ap/^) 


1 


{vie? 


(u/c)(Ap/£) 


Dynamic diffusion: 


{v/cf 


1 


[v/c? 


{vie? 



One sees that all the radiative terms are at most 0{v/c) compared to V * P, 
so that only P need be retained for practical work in the Newtonian limit. 
Thus the Newtonian momentum equation is 
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Dv 



Q 



Dt 



or 



f-Vp + G, 



(4.31) 



Dv 



Q 



Dt 



= / - Vp - V • P . 



(4.32) 



These are just the standard Newtonian momentum equations for gasdynam- 
ics in the presence of a radiation force or, equivalently, a divergence of the 
radiation pressure. What is new here? At first sight, nothing. But recall that 
when we evaluate G (which is not Gq) or V • P (which is not V • Pq) we must 
carry along 0(t>/c) terms, especially in the diffusion limit We see here, and 
will see again later, it is the radiation terms, not the material terms, that 
cause trouble! 



Total Energy Equation. Let 7 1, so that 7 — 1 — )> /c^). Then the total 

energy equations (4.27) and (4.28) become 



or 



^ge + +1 Q^e+ +p = ViP + cG° , (4.33) 

+1 e^e + lu^^+p v* + F*| (4.34) 



Note that the kinetic energy ^gv‘^ is really a relativistic effect arising from 
boosting the rest mass energy goc"^ into the laboratory frame as the fluid 
moves. Equation (4.33) is the standard Newtonian energy equation with the 
addition of the net radiative energy input into the fluid as a source term 
on the right-hand side. Likewise Eq. (4.34) is the standard Newtonian energy 
equation for the radiating fluid, written to account for the total energy density 
(material -f radiation) and the total energy flux (material -{- radiation). Is 
there anything new here? Yes: the caveat that when evaluating the radiative 
terms one must account for 0{v/c) terms. Note in particular that the energy 
exchange term on the right-hand side is not Gq. Note in passing that 
(4.33) can be written in quasi-Lagrangian form 

g^{e+^Qv'^) + V ■ {pv) =v f + G^ . (4.35) 

Mechanical Energy Equation. If we take the inner product of v with the 
momentum equation and expand to 0{v/c) we have 



or 



Q 



D 

Dt 




= -V ■ (Vp + V • P) + u • / . 



(4.37) 




4. Inert ial-Prame Equations of Radiation Hydrodynamics 197 



These again are standard Newtonian equations which we could have written 
intuitively {provided^ of course, we remember to account for the velocity- 
dependent terms in G and P. They state that the rate of change of the 
kinetic energy equals (a) the rate of work done by the gas pressure gradient 
plus the rate of work done by the total (external plus radiation) force, or, 
as in (4.37), the rate of work done by the total (gas plus radiation) pressure 
gradient plus the rate of work done by external body forces. 

Gas Energy Equation. Following L.H. Thomas again, one can reduce (4,24) 
to the relativistically correct gas energy equation 



Qo 



[Dt Veo/J 



- VaG^^ . 



(4.38) 



For ordinary body forces, VaF^ = 0. If thermonuclear reactions occur, then 
in the comoving frame -V^F^ — qqE and -VaG^ = 7(cG^a - v • G) = cGq. 
Equation (4.38) is clearly just the covariant generalization of the first law of 
thermodynamics. It states that: the rate of change of material energy plus 
the rate of work done by material equal the rate of energy input into material 
from radiation and thermonuclear sources, both measured in the comoving 
frame. Now to 0(v/c), Qo 3nd 7 -4 1. We then obtain 



Q 



De D 

p — 

Dt ^Dt 




or 



QS + C G^ — V ' G — QS C Gq , 



Q 



De D 

{- p — 

Dt ^Dt 




dE 

= ge-V‘F- — +v^{V^P) 



(4.39) 



(4.40) 



To see what terms on the right hand side of (3.39) need to be retained for 
practical applications, one can perform a dimensional analysis. One finds that 
the ratio (i; • G/cG^) is < 1 in the static diffusion limit, > 1 in the dynamic 
diffusion limit, and > 1 in the streaming limit if the material is nearly in 
radiative equilibrium. Therefore we should retain all terms on the right hand 
side. One comes to similar conclusions for (4.40); in particular all terms are 
of the same order in dynamic diffusion. We can rewrite (4.40) as 



Q 



' D 
Dt 






-1- V • (F - vE) = v ‘V P ^ QE . 



(4.41) 



Note that 



W . V • P = WiPy = - P'jvij = V • (w • P) - P : Vw , (4.42) 

hence 



Q 



D 

Dt 





+ P :Vv 



+ V • (F — vE — vP) = ge. 



(4.43) 
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Recall F — vE ~ V ■ P — Fq, the fluid-frame flux. Also, P : Vv is the 
rate of work done by the (anisotropic) pressure tensor. Equation (4.43) is a 
quasi-Lagrangian first law of thermodynamics for the radiating fluid, with 
Fo playing the role of an equivalent heat-flux vector. We will rederive this 
result from the Lagrangian view later. 

Synopsis. We have shown that for practical Newtonian calculations the dy- 
namical equations for the radiating fluid are just what we would have ex- 
pected intuitively from purely Newtonian arguments by adding an appro- 
priate radiation counterpart for each material term in the equation. Most 
previous formulations have penetrated this far. However it must be remem- 
bered that the radiation equations from which the radiation terms in the 
fluid equations are to be obtained contain terms, formally of 0(u/c), which 
are essential to keep. Without these terms one fails to discriminate between, 
say, the lab frame flux F and the comoving frame flux Fq (purely diffusive). 
It is here that most previous formulations have gone astray, and potentially 
serious errors were made by ignoring u-dependent terms in solving for the 
radiation field, and by evaluating quantities like and as if they were 
Gg and 

The mixed-frame equations have some significant disadvantages: 

1. We would need opacity and emissivity derivatives [d/du) in order to 
evaluate the right-hand side of the transfer equation or its moments. It is 
almost impossible to obtain this information for a complex spectrum. 

2. One may encounter problems with the centering of the 0[vjc) terms 
in the difference equations. Yet these terms can be crucial (e.g., in discrimi- 
nating between F and Fo). 

We shall see that these difficulties are mitigated in the comoving-frame ap- 
proach. 
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5. Comoving-Frame Equation of Transfer 

References: [4, 21, 23, 24, 3]. 

Our goal in this section is to obtain a transfer equation (and its moments) 
in which all radiation and material properties (/, f/, E, F, P, x, 77 ) axe ex- 

pressed in the comoving fluid frame (the proper frame) . The complication to 
be faced is that the comoving frame is noninertial; photons move on geodesics 
but these aren’t Euclidian straight lines; both the direction and frequency of 
a photon as measured by comoving observers change along its path. 

In our discussion it is important to distinguish the frame in which the 
physical variables {dependent variables) are measured, and the coordinate 
grid {independent variables) used to specify locations in spacetime. We want 
the former to be the comoving frame, but the latter can be any grid. For 
example: 

Lagrangian grid (Castor): grid fixed in the fluid. 

Inertial grid (Mihalas, Buchler): grid fixed in the lab. 

Adaptive grid (Winkler): grid that tracks features in 

the flow, and floats with 
respect to both fluid and lab. 

I will call all such formulations “Lagrangian” or “comoving frame” because 
that is where the physics is being done, even though only Castor’s work is 
100% Lagrangian in the narrowest sense. 

The choice of coordinates is important. Castor’s derivation (historically 
the first) is somewhat harder to follow than the others because it employs 
a full GR formalism using concepts from differenticJ geometry (even though 
the physics requires only SR). To get a start into the problem, let’s work first 
with lab-frame coordinates. 



5.1 Special Relativistic Derivation (D. Mihalas) 

Choose an inertial spacetime; the laboratory coordinates will do. Proceed ex- 
actly as one does in deriving relativistic material fluid equations, i.e., leave the 
coordinate system in the lab frame but boost all fluid (radiation) quantities 
from the lab frame to the proper frame. Assume ID spherically-symmetric 
flow. In the lab frame we know that 



l dl{n,i>) dl{n, v) 
c dt ^ dr 






= vil^, ‘') - X(M. I') 



(5.1) 



Recall that 
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hence 







( 7 ) »<■'»> 







(5.2) 



I dIo{fJ.o,i^o) dIo{fJ.o,i'o) 9Io{fJ.o,i^oy 

c dt ^ dr r dfi 



-31^ 



1^ ^ 

c dt ^ dr r dfi \ 



IoifJ'0,1'0) 



= VoM - Xo(t'o)/o(Mo,i^o) • 



(5.3) 



We must now express all quantities in the coefficients in the comoving frame. 
Likewise we convert all derivatives with respect to lab frame variables to 
derivatives with respect to comoving frame variables by use of the chain rule. 
These conversions account for the fact that all angles and frequencies are in 
the comoving frame. For example, in (5.3), in the lab frame 



’ 

that is, a partial derivative with respect to r is taken, holding t, //, and v 
fixed. What we want is to shift the derivative to fixed {fJ^o^^o)- Thus, 



d 
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1 


d(J.Q 


d 


1 


duo 


d 


dr 


tfLJ/ 


dr 


-r 


dr 


tfti> 


-r 


dr 


di'o 


d 
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i 

1 


duo 
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1 


dvo 


d 


dt 


rflu 


dt 


' 

W^QVQ 


dt 




“T 


dt 


dt'O 


d 








dfio 


d 


1 


duo 


d 


dfjt 


rtv 






dfj, 


rtp 


~T 


dfj. 


rtu dl'O 



Now use the relations (from (3.7)) 

2 _ 1 - 

ni-Pny 



^0 

Mo 



71/(1 - dn) 

fi- p 



1 - Pn 

to derive (after some algebra) 



(5.7) 

(5.8) 



(5.4) 

(5.5) 

(5.6) 



(5.9) 
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It = 

^ (5,11) 

^ ( 512 ) 



dvo 2 90 

^ = -7 (5.13) 

^ == 7^(l+/?/io)^ (5.14) 

^ = -I3j^il + 0fio)i/o (5.15) 



which are in the desired form because all physical quantities are in the co- 
moving frame. 

Using these expressions in the chain rules (5.4-6), and then using those 
expressions in the transfer equation we find, after a moderate amount of 
algebra, the comoving frame transfer equation: 



1(1 + + 7(« + 

c at or 






— ( 
dfio I 



7(1 -M^) 






-^(1 + /3Mo)^ 



loil^o, *^0)1 



di>o 



71/0 






“I Mo(l + 



^ /o ( mo, I'd)! 



+ 7 



2/xo + ^(3 — Mo) 



+ 7^(1 + Mo + 2/3/io)-^ 



+ \ [2Mo + ^(1 + Mo)] ^o(mo, I'o) 

= r]o{i'o) - Xo(i"o)1o(Mo,i'o) • (5.16) 

A few remarks about (5.16) are in order: 



- This transfer equation is relativistically correct; it is accurate to all orders 
of {v/c) {0 < (3 < 1). 

- The left-hand side contains all the mess (most of which was missed in 
earlier attempts to derive). But the material terms on the right-hand side 
are simple, and are the quantities we actually know (can compute from 
first principles). 

- The left-hand side is written in conservation form for angles and frequen- 
cies. 

“ The left-hand side can be made conservative in space too, even on an 
adaptive grid. 

“ The 0(u/c) equations will be less formidable. 
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Now integrate over angle to obtain the (relativistic) monochromatic radiation 
energy equation: 



^dEoii^o) V dFo{i/o) 



dt 



dt 



+ 7 



dFo{uo) dEo{uo) 
+ V 



dr 



dr 



I i [2Fo{i/o) + SvEoii'o) - vPo(i'o)] 
dv 



+ 7 






dr 



2v 

Eo{j^o) + Poi^o) H — T^o(^^o) 



dv 1 

^ 47r7?o(i^o) ~ cxo(i^o)^o(i^o) • 



(5.17) 



If we integrate the transfer equation against we obtain the (relativistic) 

monochromatic radiation momentum equation: 



7 


'dFoiuo) 


, dPo{vo)' 




'dPo{vo) V afo(^'o) 


c2 


dt 


+ ” at 


+ 7 


dr (? dr 



Here 



d 

di/Q 






7 ^ dv 



+ 7 






[Fo^i^o) — Qo(^o)] [Qo + '^Po{i^o)] 



2v 

^Poi^o) - Eo{uo) + -rFo{iyo) 

' 7 ^ dv 

c2 ^ [2i^o(^^o) + ^^£^o(t^o) + ^i^o(i^o)] 



7^ 9t? 
^ dt 



2v 

Eo{i^o) + Po{i^o) + ~2 ^o(^o) 



= --Xo(i^o)^o(*^o) • 
c 



QoM = 27t fillo{fM),i'o)dno ■ 



(5.18) 



(5.19) 



Note that these two equations contain four moments {Eo,Fq,Po,Qo), not 
three. Therefore to solve them we need two closure factors: / relating Po to 
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£^ 0 , and g relating Qq to £o* In fact, in general Qo is a three-index tensor 
Qijk] this tensor has four independent components in 2D, and ten in 3D. 

Finally, integrate over frequency. We get the (relativistic) radiation energy 
equation: 



7 



fdEo , dFo\ , fdFo , dEo\ 



+ 7 



- (2Fo + 3vEo - vPo) + 7‘ 
r 



> dv / 
dr \ 



Eq Pq -\ — —Fc 






7 ^ dv 



+ (2£o + vEo + vPo) 



rOO 

= [47r7?o(i^o) - cxo(i'o)£^o(i'o)]dj/o 

Jo 

and the (relativistic) radiation momentum equation: 



(5.20) 



7 /aFo aPo^ (dPo V dFo\ ri ^ 2v^\ 



/y2 Qy ^2 

+ ^ a; + ? 

1 

= — / Xo(^^o)£o(i^o) di/Q • 

c Jo 



^ ( 

dt V 



^ _ 2t; _ 

Eq + Pq H — tPq 



(5.21) 



These two equations contain only tensorial quantities £, £, and P. So we can 
check our derivation by starting with the laboratory-frame equations 

and 



^ 3P-£ 

dt ^ dr ^ r 



= -cG^ 



(5.23) 



and boosting E, F, P, and G into the comoving frame. After some algebra 
we find linear combinations of the comoving radiation energy and momen- 
tum equations written above. So it checks. The approach we took has the 
advantage of giving the monochromatic energy and momentum equations, as 
well as the full transfer equation. These allow us to determine the full angle- 
frequency variation of the radiation field. They have in fact been solved in 
some highly relativistic demonstration problems and in realistic supernova- 
envelope models. 

For many purposes we need equations accurate only to 0{v/c), E we 
expand (5.16) to first order in we get the Lagrangian transfer equation 
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1 D/o(// 0 ,l'o) Mo d r 2 t / 



De 






di'o 



I'O 



(1-^2)” +!fo|: + if22 

cr c or c 



+ 



cr 



c dr 
= Voi^o) - Xo(i^o)^oifJ'Oi^o) • 

A few things about (5.24) should be noted: 



^o(/^0 5^o)| 



(5.24) 



~ a = (dv/dt) — fluid acceleration 

- Aside from terms in a (normally negligible) this equation agrees with the 
Lagrangian equation first derived by Castor. 

- This equation looks possible to solve. In fact the full relativistic version 
has been solved, but only for steady flow [23]. 

- We get the planar limit by letting (1/r) -)* 0. 



If we integrate over da;o we get the monochromatic radiation energy equation 



D£?o(^o) , ^ ^ [ 2 rr> { \'\ 

- ^ (■'0 - Poinj)! + ^Po + ^Fo(w»)|^ 

o o 

H — [3£?o(^o) “ -Po(^o)] + [^o(^^o) -h ^o(^o)] H — ^-^o(^o) 

r or c^ 

- 47rr}o{iyo) - cxo(i^o)Eoii'o) . (5.25) 

If we integrate over /xod^o we get the monochromatic radiation momentum 
equation 

1 DFo(i/q) dPoiiyp) SPoit/p) — £/q(z^o) 

dr r 

r ) ^ ^ [^o(t'o) + -PoK)] 

= --Xo(i^o)Po(i'o) • (5.26) 

c 

Finally, by integrating over up we get the radiation energy equation 
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^ ^ (3^;o - Po) -H g + Po) + 



1 d 



dv 



2a 



Dt dr 



dr 



(? 



rOO 

= / [47rr7o(i'o) - cxo(i^o)Po(i'o)]di/o 

Jo 

and the radiation momentum equation 



(5.27) 



1 DFq dPo 3Pq - Pq 2 

Dt dr r (? 




Po + 4 (^0 + Po) 



oo 

Xo(i^o)^b(i^o)cli/o . (5.28) 

As we shall see, these equations have a direct physical interpretation. Further, 
it is clear that they can be incoporated into a practical computational scheme. 




5.2 Consistency Between Comoving-Frame 
and Inertial-Frame Equations 



Consider a grey, planar, pure-absorbing medium in LTE. Then the comoving- 
frame energy and momentum equations (omitting acceleration terms) sim- 
plify to 



9Eo dFo dE^ 




(Po + Po) = ko(47tJ3o - cEo) 



(5.29) 



1 dFo ^dPo dFo ^ 2 fdv\ Ko p 

dt dz dz \dz) ° c ” 

Start from the inertial-frame energy equation (4.11) 

^ ^ = «o (47 tPo - cE) + ^koF 

and use the 0{vfc) transformations for E and F (3.37-38) to get 



(5.30) 



(5.31) 



dEo uEq 

-a 87 + '’ 



dz dz 



^ \9z) 



v^o -r jtqj 



= /to (47rPo - cPo) - -kqFq -f 0{v^ !<?) 
c 

Now rearrange terms and use (5.30) to get 



(5.32) 



= Ko (47rBo - cPo) + O(uVc^) (5-33) 
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which is identical to (5.29). Many similar consistency checks are possible; they 
all work out. The upshot of the analysis is that we get consistency between 
the inertial-frame and comoving frame equations if, and only if, we retain 
the 0{v/c) terms in both frames. 

5.3 Noninertial Frame Derivation (J.I. Castor) 

We now work directly in the accelerated frame of a Lagrangian fluid element. 
The algebra is rather heavy, but the deepened conceptual basis makes the 
additional effort worthwhile. We can sketch only the main points here. 

Start from the Boltzmann equation for photons: 



=£-AI = 

dx^ dM^ 



(5.34) 



Recalling that photon trajectories are geodesics even in the curved spacetime 
of the comoving frame, we can write, using Christoffel symbols, 



- 0 = 






(5.35) 



But M" = dx“/d^, and = dM“/d^, so (5.35) can be rewritten 



M“ = - 






Using (5.36) in the photon Boltzmann equation we have 
D2 



(5.36) 



where 



x“ dx°‘ 1 o: ^ J 



d 

dM'T ■ 



(5.38) 



To evaluate the Christoffel symbols we need the metric of our Lagrangian 
spacetime. We return to this point later. 

For purposes of calculation it is more convenient to not use contravariant 
components M'*', but rather components measured in an orthonormal tetrad, 
which can be chosen to exploit any symmetries in the problem. Thus we use 



M“ = e“M“ , 



(5.39) 



where e“ is an appropriate transformation matrix. After some algebra one 
can show that the transfer equation in the tetrad frame is 



ra — M“ ( d - I — £ - A1 

' Dx“ U 



(5.40) 
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where 




is called the Pfaffian derivative^ and the connection coefficients 



-a 



t 



P 

c;a 



(5.41) 

(5.42) 



are the Ricci rotation coefficients. If we axe given the metric we can evaluate 
the e’s and the T’s; note that the T’s are not symmetric in lower indices 
(Christoffel symbols are). In practice only three tetrad components of M are 
independent because M is null. Thus, we will use only space components in 
the tetrad frame. 

Lindquist [21] considers the general metric 

d«2 = + e^^dU^ + R^{d0^ + sin^ 0d<t>'^) , (5.43) 



where 7?. is a generalized Lagrangian radial coordinate, and r is “time” as 
measured in the Lagrangian frame. Now impose spherical symmetry; then 
d/d9 — d/d(t) = 0, and we need calculate only terms in 'll and r. 

This calculation is not difficult, but it is laborious. One finds 16 nonzero 
Christoffel symbols for the metric in (5.43). The metric also implies e" and 
its inverse Given the Ps and the Christoffel symbols, one can calculate 
the Ps\ after a lot of algebra one gets ten nonzero values. Next, choosing 
tetrad components: 



II 

o 


(5.44) 


= V cos 9 — v\i 


(5.45) 


— z/sin0cos^ = z/(l — cos# 


(5.46) 


= 1 / sin 0 sin # = z/(l - sin # 


(5.47) 



we can find the Jacobian J ( ^ — ) and its inverse J I • 

V z/, 0, # J J 

We can then expand 



d 



dv d dfi d d 



dM^ dM^ du 



dM^ dfjL ' dM^ 9# ^ dv^ v d\i 

and similarly for 9/9M^ and d/dM^. Putting all the pieces together, with 
a large amount of algebra we can evaluate M"DI/Dx", obtaining finally 
Lindquist transfer equation 



d I ~ d 



(5.48) 



DrT -f /xDt^X — u 

+ (l-M^) 



MDK<? + M"D^yl + (l-/i")| 



du 
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where 



D. 


_ ->P 9 


(5.50-a) 


III 

o 


(5.50-c) 


Dt? 


1 

CD 

111 


(5.50-b) 


U = DrR. 


(5.50-d) 



So all we need is a metric! 

Castor adopted the following coordinate transformation from the inertial- 
frame coordinates to Lagrangian coordinates (^): 

Mr{r^t') — f 47rr^^g{r\t')dr (5.51) 

Jo 

and 

= — ~ [ v{r'yt')dr' (5.52) 

Jo 

which repesent an approximate Lorentz transformation locally in the neigh- 
borhood of (r, C). It is then easy to show that 



dx = 



/ dMr \ 
\47rr^^ / 



= dr - vdt* , 



(5.53-a) 



d^ = 




(5.53-b) 



where 



Jo dV 



(5,54) 



and we used the equation of continuity to show that {dMrfdt')r = — 47rr^^u. 
Inverting equations (5.53) we get 



dr = 



' 1 - 

D 



dx + 




dt' = 




where 

D = l- \{I + v'^) . 



(5.55) 

(5.56) 

(5.57) 



Now plug equations (5.55) and (5.56) into the inertial-frame metric 



= dr^ -I- r^(d6^ + sin^0d(j>^) — c^(dt')^ 



(5.58) 



to get 
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ds^ = F 




+ {dB^ + sin^0d(/>^) - Gdt^ - 2HdMrdt , 



(5.59) 



where 



F = 



1 




(5.60) 



G = (c^ - t;2)/£)2 , (5.61) 

H = vl j [A-nr^ Q(? D^) . (5.62) 



One can easily show that H is 0{v^ /(?), hence it can be dropped. Similarly 
F -»• 1 + 0(v^ /(?) and G + 2/ + 0{v^ !<?). 

We now have if", yl, and 7^ in the Lindquist metric. Therefore we can 
plug these quantities into equations (5.50) and (5.49), and after considerable 
amount of algebra we get the comoving frame transfer equation 



1 D/o(/Lto,I'o) , . ^ r 2 r C M 

+ 47r^Mo^irrF A)(mo, t'o)] 



Dt 



d 



duo { 

duo \ 



(1-/^^) 



'dMr 

1 + ^ 

r c 



/ 3u D In ^ 



cr c ut 



+ 



(1 _ 3^^)— - (l + A*o)Dl”g 2^00 

cr ■ 



c Dt 
= VoM - Xo(t'o)Jo(/io,i^o) • 



b(Ato,i^o)| 
1 o(mo,^'o)| 
/o(mo,i^o) 



(5.63) 



The following remarks are appropriate: 



1. Equation (5.63) is a truly Lagrangian equation. Note the presence of 
d/dMr and D\ng/Dt. 

2. Of course Eq. (5.16) can be cast into this form by an appropriate grouping 
of terms. Indeed, using the equation of continuity, D In g/Dt + [dv/dr) -f 
(2u/r) = 0, it is easy to show that Eq. (5.63) is identical to (5.16). 

3. Taking angle moments we get the same radiation energy and momentum 
equations as before. 



For future reference, the radiation energy equation becomes 



D 



Eo 

Q 



+ A'Kg- 



d 



(r^Fo) 



/Dine 
V Dt 



= f 



dM, 

[4nr)oM - cxo(»^o)Fo(i/o)]di/o 






(3Fo - Fo) + ^Fo 



(5.64) 
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and the radiation momentum equation becomes 



+ -V. 



c2 Di V 



a 1 

“I — ^ (-^0 + Po) = — / Xo(^o)Po(i^o) • 
c Jo 



(5.65) 



General Geometry, From a different 0{v/c) analysis Buchler showed that 
the radiation energy and momentum equations can be rewritten in general 
tensorial form: 

^ ^ ^ ^ ^ ^ (5.66) 



^ D / Fq\ 
Dt \ g ) 



+ V.Po + -r(Fo- V)t; 



+ ^ {Eoa + a * Po) + Go — 0 . 



(5.67) 



These equations are useful for working in other geometries {e,g., cylindrical, 
3D Cartesian). Note that the monochromatic forms and the full transfer equa- 
tion in Buchler’s paper contain numerous typos (and some alleged errors); 
more complete and accurate formulae have been published by [26] . 



5.4 Analysis of 0{v/c) Terms 

We can play the same dimensional analysis game with these equations as 
we did with the inertial-frame equations. Consider first the energy equation 
(5.66); as before consider fluid-flow timescales only. Ignore the acceleration 
terms which are never larger than 0{v/c), 







Streaming: 


Static diffusion: 


Dynamic diffusion 


A 


{vie) 


(u/c)(£/Ap) 


1 


B 


+ ^i:{r^Fo) 


1 


1 


{elv){Xpl£) 


C 


-^(3Po-Eo) 


{v/c) 


(t;/c)2 


(n/c)(Ap/£) 


D 


- (^) Po 


{vie) 


(v/c)(^/Ap) 


1 


E 


— {Attt} — cxE) 


(^/Ap) 


1 


1 



In the streaming limit we need to retain the flux divergence B and the source- 
sink term E. We would need the time derivative term A if we tried to track 
radiation fronts. Also, if the medium is very nearly in radiative equilibrium, 
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term E ^ 0, and both DEq/DI and the velocity-dependent terms may be 
needed to balance (or drive) V ♦ F. In static diffusion we need only the 
divergence term B and the absorption-emission term E so long as (u/c) < 
(Ap/^). As (v/c) (Ap/^) we need to retain all terms except term C, (3Po ~ 

Eo)An practice we would keep C anyway so that in spherical geometry we can 
identify all terms in the radiation work term Pq : Vv. In dynamic diffusion 
we need the BEo/Dt term A, the work term D, and the absorption-emission 
term E. 

In short, to guarantee a correct solution of the comoving frame radi- 
ation energy equation in all regimes, we must retain all time- and velocity- 
dependent terms. Thus we reach the same conclusion as we did in the intertial 
frame (though the actual terms are, of course, quite different). 

Similarly for the momentum equation (5.67) 

ABC D E 

Q D f I dPo I SPq—Eq Fq av Xo-^0 

^ Dt y Q J dr ^ r c 

Streaming: (i;/c) 1 1 (tj/c) (^/Ap) 

Diffusion regime: (i>/c)(Ap/£) 1 {vlc){\pli) (^^/c)(Ap/£) 1 



In the streaming regime we need terms B and C (which together give V • Pq), 
and the radiation force E. In the diffusion regime we also need only these 
terms. Thus to guarantee the correct solution of the radiation momentum 
equation in all regimes we need retain only 



dPo 3Pq ““ Eq 
dr r 




(5.68) 



This result is much simpler than the inertial-frame equation. If we want to 
follow radiation flow, we would also need to retain the term in DF/Dt. Castor 
arrived at the same conclusions for the radiation energy and momentum 
equations by considering stellar pulsation with a frequency u (which sets the 
characteristic value of D/Dt). 



6. Lagrangian Equations of Radiation Hydrodynamics 

In this section we shall derive radiation hydrodynamics equations in the co- 
moving frame, ignoring acceleration terms, which are 0{v^ j(?)^ on a fluid-flow 
timescale. 



6.1 Momentum Equation 

If we reduce the relativistically correct equation, derived earlier (4.29), to the 
comoving frame (in which u = 0 instantaneously), we get to 0(u/c), 
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^000^ — / - V/? 4- Go . (6.1) 

Here we ignored the 0{v^ jc^) difference between the laboratory frame time t 
and proper time r. If the fluid is not relativistic internally (i.e., at extremely 
high temperature or degenerate), we can set ^ooo Q‘ We then get (for ID 
spherical flow) 






Q 



Dt 



GMrO 

j,2 



dp 1 
or c 



I 



oo 

Xo{^o)Fo{uo) diyo . 



( 6 . 2 ) 



This equation is just what one would guess intuitively from Newtonian ar- 
guments; but note the specification of a particular (comoving) frame for the 
radiative term. 

We can rewrite Eq. (6.2) by using the radiation momentum equation (5.65) 
and the continuity equation. We obtain 




GMrQ _ d{p -h Pq) ^ 3Pq - Eq _ 1 ^ 

dr r dr ^ 



(6.3) 



This equation states: 



The rate of change of the total momentum density 
of the radiating fluid 

— the external force 

— the divergence of the total pressure (p I -h Pq) in the 
radiating fluid 

— a relativistic term arising form the inertia for radiant energy . 

On a fluid-flow timescale all terms in Fq are at most 0(v/c), therefore we 
may drop them in nonrelativistic practical calculations. We obtain: 

g^ = f-Vp-VPo. (6.4) 

This result is again what one would guess from purely Newtonian arguments. 



6.2 Gas Energy Equation 



We have seen (in Sect. 4.2) that the relativistic gas energy equation is 



^0 



De D / 1 V 
Dr Uoj. 



-y„(F"+G“) . 



(6.5) 



In the comoving frame the right-hand side reduces to c{Fq+Gq). For ordinary 
body forces F§ = 0; if thermonuclear reactions occur, VaF^ — qq€ (e.g., in 
stellar interiors or in a laser- fusion pellet). Then 
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We can now drop 0{v^ !<?) terms, hence Q, D/Dr D/Dt, obtaining 

*D6 D /1\1 roo 

^ Dt (,ejj ^ /o 

Clearly Eq. (6.7) is the first law of thermodynamics for the material. 



6.3 First Law of Thermodynamics for the Radiation Field 

To obtain the desired result, we can simply rewrite the Lagrangian radiation 
energy equation (5.64) as 



rOO 2 Q 

= J [47r7jo(r'o) - cxo(^'o)£;o(t'o)]di/o - , (6.8) 



The rate of change of the radiation energy density 
per unit volume 

d- the rate of work done by radiation stress (P : Vv) 

= the net rate of “heat delivery” to the radiation field 
(i.e., energy emitted by gas — energy absorbed by gas) 

— the divergence of the radiant “heat flux” out of the volume . 

Equation (6.8) is just what one would expect for a nonviscous, heat-conduc- 
ting fluid. 



6.4 First Law of Thermodynamics for the Radiating Fluid 

To obtain the first law of thermodynamics for the radiating fluid simply add 
the first law of thermodynamics for the material (6.7) to the first law of 
thermodynamics for the radiation (6.8). We find 



D 

m 






= £ 



dMr 



i^nr^Fo) , 



(6.9) 
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The rate of change of total radiating fluid energy density 
per unit volume 

-(- the rate of work done by total stress in radiating fluid 

— the rate of energy input from irreversible processes 

- the divergence of the total “heat flux” out of the volume 
(material conduction ignored here) . 



In the diffusion limit, 3Po = Eq high order terms, therefore we can simplify 
(6.9) to 






e + — 1 + (p + Po) 



D 

m 






e - 



dLr 

~dM~r 



( 6 . 10 ) 



which is one of the standard equations of time-dependent stellar structure. 
Because the nuclear timescale (of the order of 10^ years) is so much greater 
than a characteristic dynamical time of a star (of the order of a few days at 
most), in following stellar evolution we can go to the quasistatic limit, hence 



dLr _ 
dMr ~ ^ ’ 



( 6 . 11 ) 



which is one of the standard equations of stellar structure and (quasistatic) 
evolution. 



6.5 Mechanical Energy Equation 



If we multiply the momentum equation (6.2) by v, we get 



D(|^ _ GMrQV 



Dt 



— V 



dp 

dr 






c Jo 



Xo{i^o)Fo{uo) duo 



The rate of change of the mechanical energy density 
= the rate of work done by gravity 
-h the rate of work done by the pressure gradient 
-j- the rate of work done by radiation forces . 



(6.12) 



6.6 Total Energy Equation 

Using the equation of continuity, and the relation V • (pv) = pV • u -f -u * Vp, 
we can rewrite the mechanical energy equation as 







GMr 



d 



dMr 

D 



(47rr^up) 



= P 



Dt 



- I + 



— Xo{i^o)EoMdiyo > 

Jo 



(6.13) 
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Eliminating the radiation force on the right-hand side by using the radiation 
momentum equation (5.65) we get 











(6.14) 



Now add the first law of thermodynamics for the radiating fluid (6.9) to 
obtain the total energy equation of the radiating fluid: 



Df 



' ^^ 0^1 2 
e+- + -v 
K Q 2 






(6.15) 



The rate of change of the total energy density in a fluid element 
-i- the divergence of radiant energy flux 
-h the divergence of the rate of work done by stresses in the 
radiating fluid 

= the rate of energy production by irreversible processes . 

This equation is a direct analog of the total energy equation for a heat- 
conducting, but inviscid, ordinary material fluid. (Compare!) The energy 
density and stress now have both material and radiation components. In 
the diflFusion limit we get the same equation as is used in dynamical stel- 
lar structure calculations. If we now integrate (6.15) over a volume contain- 
ing a star, apply the standard boundary conditions of stellar structure work 
{T 0,p 0, as r R), and note that T>/Dt and d/dMr commute, we get 



The rate of change of the internal, radiant, and kinetic energy 
in the entire star 

-h the rate radiant energy emerges from stellar surface 
= the total rate of release of gravitational potential energy 
-f the rate of energy production by thermonuclear energy release . 

We can express (6.15) in Eulerian coordinates: 



dt 



ge + Eo + 



GMrQ 



+ V 



+ Po +P+ e (e+ - ^)] I) 



= ge . 



(6.17) 
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But Fo + vEq + vPq ~ F, and Eq = E 0(v/c). Therefore 



dt 



ge + E -QV^ 



GMrQ ^ 



j__a 

J .2 




F +V 



p-\- gle-\- ~v 




(6.18) 



6.7 Consistency of Different Forms of the Radiating-Fluid 
Energy and Momentum Equations 



Let us now examine the consistency of various forms of the radiating fluid 
energy equation. In the optically thin limit the “natural” form of this equation 
is the gas energy equation. 



,De D /'I 
I Di \Q 



-f 



[cXo(*^o)i^o(*^o) - 4-iTTio{i'o)]<ii^o + , (6.19) 



which expresses energy gains and losses of the gas directly. In the diffusion 
limit, the right-hand side of (6.19) approaches zero by cancellation to many 
digits. It is then natural to use the first law of thermodynamics of a radiating 
fluid: 





- - (3Po - Eo) 



( 6 . 20 ) 



In practice we must choose only one of these equations in a real computation 
even though the flow spans both optically thick and thin regimes. We can 
guarantee (analytically) a correct transition from one regime to the other if 
and only if 



1. all of the D/Dt and t;-dependent terms are retained in whichever of (6.19) 
or (6.20) are chosen, and 

2. all of the D/D< and u-dependent terms we decided were necessary are 
retained in the radiation energy equation. 

If we do not satisfy these requirements, then disaster follows. Castor shows 
that dropping the 0{v/c) terms causes an error oi 0{P/p) in the regime for 
which the energy equation chosen is not well suited. This error may be ac- 
ceptable if P/p 1, but will be a serious error if P/p 1 (high temperature 
material, interior of a star, etc.). 

The situation for the momentum equation is totally different. Here we can 
drop DFo/Dt and velocity-dependent terms multiplying Fq because all are 
at most 0(u/c). Moreover these terms become 0{v^ /(?) in the mechanical 
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energy equation. Therefore we can ignore all 0{v/c) terms in the radiation 
momentum equation without affecting consistency among various forms of 
the energy or momentum equations. 



6.8 Consistency of Inert ial-Prame and Comoving-Prame 
Radiation Energy and Momentum Equations 



We have already shown that we must keep 0{v\ terms to assure consistency 
of inertial-frame and comoving-frame radiation energy and momentum equa- 
tions. How about the radiating fluid equations? 

Start from the inertial-frame energy equation (4.40): 



De D 




9E „ 

- + V.f\+v 



Convert this equation to the fluid frame to 0(u/c): 

“ On a fluid-flow timescale c~^{dF/dt) is 0{v^ !<?) compared to V -F. There- 
fore drop it. 

- We don't need 0(u/c) terms in converting E Eq and P Pq because 
they produce 0{v^ /(?) terms, 

- We must retain all terms in F = Fq + vEq -h v • Pq- 

- In spherical geometry: 



V F = \^[r'^{Fo+vEo+vPo)] 



dFp 2Fq 

1 1 - V 

or r 



dEo dPo 



dr ^ dr 



^ + y) (Eo + Po) 



— p 9Pq (3Po - Eo) 

V • ro — “5 1 

or r 



Using these results in (6.21) we obtain: 



Q 



[De D 

h P — 

Dt ^Dt 




DEq 1 a(r"Fo) 

Dt r^ dr 



H — (3Fq — Pq) + (Fo “I" Pq) 
r 



dv 

dr 



( 6 . 22 ) 



Note that the right-hand side of (6.22) is identical to the left-hand side of the 
radiation energy equation (5.27). Replace it by the right-hand side of that 
equation. We then get an exact reduction to 

"De D /l\l poo 
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the comoving frame gas energy equation (6.7). 

Alternatively we can start with the inertial-frame equation (4.39) for grey 
material, 



De 

Dt 






(6.24) 



But from the 0{v/c) version of the Lorentz transformation, E = Eq + 2v 
Fo/c^y and v ^ F = v • Fq 0{v'^ /c^). Thus (6.24) reduces to 



Q 



FDe D 

h P 

Dt ^Dt 




= Kq{cEo- 47tBo) 



(6.25) 



again, the comoving-frame gas energy equation for grey material (6.7). 
Now consider the inertial-frame momentum equation (4.30) 



Du 



= /- Vp- 



^ dF „ \ V (dE „ 



(6.26) 



- We can ignore 0[vjc) terms in converting F Fq; they produce 0{v^ /(?) 
terms. 

- We must keep 0{vlc) terms in the transformations E — Eq (2u/c^)Fq 
and P = Pq + {2v/c^)Fq. 

- We can drop {v/c^){dE/dt) which is 0{v^ /(?) compared to VP. 

- In spherical geometry: 



^ ^ dP (3F-F) 
V-P- — + 

or r 



and 



V F = 



dF 2F 
dr r 



Using these results in (6.26) we obtain: 
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(6.27) 



The radiative terms on the right-hand side of (6.27) are identical to those 
on the left-hand side of the comoving-frame radiation momentum equation 
(5.28). Therefore replace them with the left-hand side of that equation. We 
obtain 

Du GMrQ dp 1 

^ ^ Jo 

which is the correct comoving-frame momentum equation (6.2). One can 
perform many other checks; all of them show consistency, so we can trust our 
equations. 
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7. Radiation Diffusion 

Having derived the equations of radiation hydrodynamics, how do we solve 
them? For a preliminary orientation, it is instructive to count the number 
of variables and the number of equations available to determine them. For 
the material, the variables that appear in the equations are v,e,p, and T. 
There are a total of seven variables (the velocity has three components). To 
determine these quantities we have the equation of continuity, three com- 
ponents of the momentum equation, the gas energy equation, a mechanical 
equation of state p = p(^,T), and a caloric equation of state e = e(^,T). We 
thus have the same number of equations as we have unknown variables. The 
system is complete. In fact, we can ignore T and p if, as is true for, say, a 
poly trope, we know p = p(^,e). 

For radiation we have to determine the scalar E, the vector F, and the 
symmetric tensor P, which in 3D comprise one, three, and six components 
respectively, for a total of ten. To determine these variables we have the 
radiation energy equation plus three components of the radiation momentum 
equation. Thus we are short by six equations] The missing information must 
be supplied by a closure relation. A convenient closure relation is provided 
by the Eddington tensor, f = P/E, which, however, is not known a priori, 
and must be computed in an auxiliary calculation. In ID we need only one 
component; in 2D we need three; and in 3D we need six. In ID the calculations 
of the Eddington factor is relatively easy. In 3D it can be quite difficult, and 
costly. 

The simplest possible case is diffusion. Here we know a priori that down 
the diagonal / = |, and all off diagonal elements are zero. So let’s consider 
this case first. 

In general we will solve radiation hydro problems numerically on a discrete 
grid. We will discuss this procedure later; here we make only a few simple 
remarks. Typically the physical variables are centered in space on a staggered 
mesh as shown in Fig. (7.1). The centering in time can be leapfrog if we use 
an explicit method, or we can synchronize all variables at a given time level 
if we use an implicit method. The basic idea is that if we are given initial 
conditions we can construct the solution at a sequence of advanced times by 
means of a suitable numerical marching procedure. We do not require further 
information about the procedure now; we will treat it in more detail later. 
For now we will deal with physical issues. 

7.1 Radiation Diffusion 

The basic assumptions of all diffusion theories is that the test point is far 
from boundaries (optical depth r ^ 1), and that the particle mean free path 
is very small compared to a typical structural length (Ap < ^). In the case 
of radiation, there are several “flavors” of diffusion. 
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Fig. 7.1. (a) Centring of physiczd variables on Lagrangian mesh, (b) Spacetime 
centring of physical variables on Lagrangian mesh 



- Equilibrium diffusion assumes that the radiation temperature is identical 
to the material temperature so that E = 3P = aRT"^. 

- Equilibrium diffusion applies strictly only for material very far from bound- 
aries or regions with steep temperature gradients. 

- In equilibrium diffusion it is always assumed that the radiation has a 
Planckian spectrum. Thus we can always replace a nongrey opacity with 
the Rosseland mean, and we never need multigroup (multiple frequency 
groups) diffusion. 

- A second-order diffusion theory can be worked out for equilibrium diffusion. 
It allows us to develop a theory for radiation viscosity. 

- In nonequilibrium diffusion we allow the radiation temperature Tradiation 
to differ from the material temperature Tmateriai- Then E = 3P = <irT^ 
where Tr / T. Thus this theory can give decent results even near a bound- 
ary surface. It is also reasonable to use for, say, the penetration of radiation 
from a hot source into cold material. 

- In nonequilibrium diffusion the multigroup problem makes sense because 
the spectrum of radiation is not constrained to be Planckian, and, in fact, 
the spectral distribution of the radiation field is one of the things we want 
to determine. 
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Ordering of Terms. In static media we saw that in the diffusion limit all 
radiation properties are specified by local fluid properties and their gradients. 
There is only one significant parameter in the problem: (Ap/^). In moving 
media the situation is more complicated. Here we have two small pgurameters, 
(Ap/^) and (v/c), which are independent. We must now be careful in sorting 
terms in order to get a valid asymptotic theory: 

- Zero order: drop all terms 0(Ap/^), 0(t;/c) or higher. 

- First order: retain 0(Ap/^), 0(v/c). Work in comoving frame to find that 
Oiv/c) terms are absent. (The photon horizon contracts to a point; the 
observer is unaware he is moving). 

- order: retain terms 0(-f ^). Such terms are the next dominant terms 
after 1st order. 

- Second order: (a) retain terms of 0(v^/c^). These terms are contained in 
the covariant expression for R which can be worked out to all orders in 
(v/c); or (b) retain terms of 0(Ap/^^). This expression is not used at all 
because it gives poor results in ordinary gas dynamics. 

~ In view of the above, we whall refer to the “l|” order expression as “2nd 
order” . 

One final remark: all of the arguments we shall use below are heuristic. A 
rigorous asymptotic theory is both possible and needed; this work would 
make a good Ph.D. thesis. 

Thermalization. In static media we need require only r 1, i.e., (Ap/^) 1. 

In moving media we should demand in addition that a photon be destroyed 
in the same thermal environment as it was created. Thus we must assure that 

U = — « 1 • (7.1) 

C V t c 

This condition is almost always assured whenever diffusion is appropriate; 
certainly it is true in the stellar interior. For scattering, replace Ap by A, the 
thermalization length. 

Zero-Order ^^Diffusion”. For grey matter, the radiation energy equation 
(5.64) becomes in the equilibrium diffusion limit 




and the radiation momentum equation (5.65) becomes 



c ' Q D f Fo\ IdEo Fodv' 
Xo V ^ ^ 3 ^ dr 



(7.2) 

(7.3) 
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Consider material that is so homogeneous that we can ignore all gradients 
within an interaction volume. Formally, let (Ap/£) -)* 0. Then the right-hand 
sides of both (7.2) and (7.3) vanish and we have Eq (^tt/c) (t^o/Xo) = 

(47t/c)jBo and Fq 0 Moreover, if there are no gradients, then the radiation 
field is isotropic, hence Pq — Thus in the comoving frame, the radiation 
stress-energy tensor is: 



Ro 



f 0 \ 

V 0 



(7.4) 



Thus in this limit radiation behaves like an ideal fluid: it has an isotropic 
pressure, and there are no transport effects. Therefore there is no radiation 
flux. 



1st Order Diffusion. Let us now look for terms that are 0(Ap/^) or 0{vlc). 
Start with the radiation momentum equation 



Q D / Fo\ Fodv dPo 
Dt V ^ dr dr 

A B CD 




(7.5) 



Prom a dimensional analysis one can infer that B/A and C/A are both 
0{Xpv/ic). Therefore we can drop these terms for now. Thus in equilibrium 
diffusion, Fo = -(c/xo)V • Po = -(c/3xo)'^Eo = -(aRc/3xo)VT^ 
or 



Fo = -FrVT , 



(7.6) 



where the radiative conductivity is 



^ 4aRcT^ 
3X0 



(7.7) 



Equations (7.6) and (7.7) are the same as derived earlier for static diffusion. 
Now we see that they also apply in the comoving frame of the moving ma- 
terial. Note the strong dependence of /Cr on T. It is made even stronger by 
the variation of xo in the stellar interior. This sensitivity of Kr to 

T will turn out to be important when we study thermal waves. 

For nongrey material a similar analysis yields 



^o(i^o) = - 



47T 



^Xo(^o) 



VB{uo,T) = - 



47t 9B(i/o,T) 

dT 



VT 



(7.8) 



We then find that to obtain the correct integrated flux we should use the 
Rosseland mean xr (7.6) for Kr instead of xo- Note in passing that Fq 
is, in fact, 0(Ap/^) • Eq. Eq is the dominant term on the diagonal of Rq- 




Xoc 








Dt 




D 



A 



B 



C 



(7.9) 
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Then from dimensional analysis we can infer: 

Thus the departure of Eq from AnBofc is at most Xpv/£c (a result we used 
earlier) . 

Thus a consistent first-order expression for the radiation stress^nergy 
tensor in the comoving frame of a moving fluid in the equilibrium diffusion 
limit is 



' anT^ 



There are interesting contrasts 
a material fluid: 



-3^V(a.r-, \ 







) 






anT* 


-—VT ) 






VT 

c 


c 


(7.10) 




l«ar*l J 


between this 


stress-energy tensor and that of 



— For material, Chapman-Enskog theory gives both viscous and heat-conduc- 
ting terms to 1st order in {Xp/£), 

— But for radiation we get only “heat conduction” . 

- We will see later that radiative viscosity is The physical rea- 

son radiation viscosity is so small is that for a material particle (mo- 
mentum/energy) ^ {mv)/{mv‘^) 0 ( 1 / 1 ;), while for a photon, it is 

{hi/ / c) / (hi/) 0(l/c). Therefore photons are much less efficient at trans- 

ferring momentum per unit energy transport. 

- Note that no 0(i;/c) terms appear in Rq in the comoving firame. 

Equations of Radiation Hydrodynamics in 1st Order Diffusion Limit. In the 
equilibrium diffusion limit Rq can be expressed in terms of T and VT. No 
new information beyond that used in an ordinary material fluid calculation 
is required. Using Rq in the general dynamical equations = /“, we find 
the momentum equation 

and the radiating fluid energy equation 
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These are the standard equations used in dynamical stellar structure calcu- 
lations. 

Let us now sketch briefly how the computation proceeds (in spherical 
symmetry). Suppose at P we know g, v, e, p, and T. Then we can compute 
the acceleration from the momentum equation. We use this acceleration 

to update the velocity: ^ ^ + Atv^- We then use the new velocity 

to find the cell interface positions at -f . The inter- 

face positions imply volumes, hence densities, of mass shells. We must then 
solve the energy equation at Assuming e(^, T) and p{q^T) are known 
(e.g., from tables), we have a nonlinear equation for The highest-order 

differential operator in this equation is V * V, hence in discrete form the equa- 
tion is tridiagonal To solve it, we linearize it with respect to T, to derive an 
equation of the form: 









We then solve (7.13) for the corrections ST to the current estimate of the 
temperature distribution. We apply the corrections, and iterate the procedure 
to convergence. Then we proceed with the next time-step. This algorithm is 
basically very simple and cheap, hence the popularity of this approach. 

Second- Order Equilibrium Diffusion. Now let’s go after terms that are 
0{Xpv/£c), which are only linear in each of the two small parameters, and 
therefore are likely the next largest terms. The new physics that emerges 
concerns radiative viscosity. This problem has been studied by Jeans, Milne, 
and others; the best work of that era was by L. H. Thomas [31]. Thomas 
started with the mixed-frame transfer equation and developed a solution for 
I by iteration. Given this expression for /, one can evaluate R by integration 
over LJ and u. This calculation is straightforward but tedious. It is impressive 
that Thomas was able to derive all these results and cast them into covari- 
ant form by inspection. A less cumbersome and physically more appealing 
approach was developed by [30]. After a considerable amount of algebra he 
was able to derive covariant expressions for the radiation energy density £, 
the radiative energy flux T, and the radiation pressure as well as the ra- 
diative conductivity A"r, which is the same as given in (7,7). The coefficient 
of radiative viscosity (a world scalar) is 



Mr = 



4 

15 




(7.14) 



which can be derived to within a numerical factor of 0(1) by simple mean free 
path arguments. From these quantities we can derive a complete covariant 
radiation stress energy tensor. To gain further physical insight one can reduce 
the relativistically correct results to the comoving frame and retaining terms 
to 0{Xpv/ic) to get 




7. Radiation Diffusion 225 





(7.15) 


iFo)i = -KK(r,i+^a,^ , 


(7.16) 




(7.17) 



where is the traceless rate of strain tensor, 

and 

P = iaRT*-5MR(2^) , 

and 

Cr = • (7.20) 

- The radiant flux contains an acceleration term which is identical with 
Eckart’s relativistic expression for the conductive energy flux in a material 
fluid. 

- From (7.20) it appears that radiation has a substantial bulk viscosity (Cr). 
[33] showed that this is an artifact that arises because Thomas and Simon 
use the material temperature T, not the thermal-equilibrium temperature 
T\ 

- Using the radiative viscosity in place of the material viscosity we can define 
a radiative Reynolds number 



(7.18) 

(7.19) 



material kinetic energy density 
radiation energy density 



Ren can be quite small in high-temperature flows. At first sight this result 
might imply that radiative viscosity might be important. However, from 
a dimensional analysis one can show that the radiative viscosity terms 
(radiative dissipation function) are only 0(u^/c^) relative to the dominant 
terms in the energy equation. Thus it appears that in the radiating fluid 
energy equation we can neglect energy dissipation by radiative viscosity. 

“ In the momentum equation for a radiating fluid the radiation viscosity 
terms are 0{\pvlic) and thus should be retained. One can show that these 
terms could be important in some cases, particularly when Ap/£ ^ 1. But 
of course diffusion theory is not valid then, and we must make an analysis 
based on full (comoving frame) transport theory. 

— Taking the full comoving-frame radiation energy equation to the diffusion 
limit we recover the results of 2nd-order diffusion theory result exactly. Fur- 
ther, one sees that the acceleration term in the comoving-frame equation 
comes from the inertia of the radiant flux. 
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“ The comoving-frame radiation momentum equation in the diffusion limit 
reduces exactly to the 2nd-order diffusion theory form. One can show that 
the acceleration terms in the radiation momentum equation account for 
the inertia of radiation enthalpy, while terms like c~^F ■ Vv account for 
the interaction of the radiation momentum density with shear flow. 

- Virtually all existing (Newtonian) discussions (e.g., by [7, 13, 14, 18, 19]) 
of radiative viscosity in a radiating fluid are inconsistent; they omit terms 
of the same order as are retained. 

- Analytical 2nd-order radiation diffusion theory, while giving important re- 

sults of principle, isn’t too useful because the new terms are negligibly 
small when \p/£ 1, where the theory is valid, and become significant 

only in the transport regime where Xp/i > 1, where diffusion theory breaks 
down. 

- The obvious thing to do is to use the full comoving-frame radiation en- 
ergy and momentum equations [e.g., 0{v/c) equations from [4]] because 
(a) these are valid in the transport regime and give a correct description of 
the nonlocal energy and momentum exchange between radiation and ma- 
terial when Xp/£:$> 1, while (b) limiting to the correct form in the diffusion 
limit. 



7.2 Nonequilibrium Diffusion 

Diffusion theory as discussed thus far is very restrictive because it demands 
strict thermal equilibrium between the radiation and the material, which 
implies the radiation field is Planckian at Tr = Tmateriai- We can develop 
a much more powerful and useful theory by assuming Xp/i ^ 1 (i.e., the 
material is opaque) but that the radiation is not in equilibrium with the 
matter. Some examples of physical situations of this type are: 

- Suppose the radiation field is imposed from the outside with Trad ^ 
Tmateriai, 2is in Stellar radiation falling on an interstellar nebula. 

- Suppose fluid dynamics drives the material properties (e.g., T^) faster than 
the matter can relax radiatively (e.g., as in a strong shock). 

“ Suppose the radiation field changes (for a variety of possible reasons) faster 
than the material can follow instantaneously. 

The advantages of nonequilibrium theory are: 

— The radiation energy density E can be quite different from t^R^materiai • 

— The spectrum of the radiation field can be non-Planckian. 

We will again formulate the theory in the comoving frame. By exactly the 
same arguments used for equilibrium diffusion we again find: 
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But now we do not replace Eo{uq) with (Anfc)B{uQ,T), Next, examine the 
Lagrangian monochromatic radiation energy equation (5.25). Using (7.22) for 
Fq{uo), and requiring isotropy (because departures from isotropy are higher 
than 1st order), we get: 






= Ko(i^o) [47 tB(i/o,T) - cEoil'o)] + V • 



3xo(i^o) 



VEoK) . (7.23) 



Equations (7.22) and (7.23) are the monochromatic nonequilibrium diffusion 
equations. 

Now integrate over uq; note that the (d/duo) term vanishes. We obtain 
the radiation energy equation (nonequilibrium diffusion equation) 



3^”Dt UJ. 

(^ VEo) + c (/cporT'* — keEo) 



= V- 



(7.24) 



and the integrated flux 

3x 



(7.25) 



Here /cp is the Planck mean, and ke is the absorption mean defined as 



poo j poo 

Kp = y KQ{yo)B{vo,T)duQ j J B{i'o,T)di/o , 

ke= f Ko{uo)- ° - ^°- duo = f Ko(Po)e(i/o)di/o , 

Jo ^0 Jo 



(7.26) 



(7.27) 



where e(i/o) is the spectral profile of the radiation energy density. Likewise, 
X is defined such that 



1 1 
— V£?o = / — 7 — Eo{i^o)duo 

X Jo Xoi^o) 



(7.28) 



(using different values along different coordinate axes if necessary). It is often 
convenient to introduce a radiation temperature Tr: 



= Eo- 



(7.29) 



Note that Tr is only a parameter which is related to the total radiation 
energy density Eq. In nonequilibrium diffusion, Eq{uo) is not Planckian at 
Tr\ 
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When solving the equations of radiation hydrodynamics using nonequi- 
librium diffusion, we must determine two variables at time-level namely 
T and Tr. So now we solve the gas energy equation 



Q 



FDe D 
D< 




= orc (keT^ - KpT‘‘) + ge 



(7.30) 



and the radiation energy equation 



[D /aftTn 1 D /IV 



3X 



(7.31) 



simultaneously. This approach is sometimes called the two-temperature de- 
scription of a radiating fluid. As before, we linearize with respect to ST and 
(5 Tr at We now get a block tridiagonal system with (2 x 2) blocks. 

This system can be solved by Gaussian elimination to give the temperature 
corrections, and these results are used in the nonlinear system to iterate to 
consistency. 

In the discussion above we have assumed that x ^~E (hence the 
frequency-dependent information required to compute them) are given. How 
should we proceed in reality? 



- The simplest approximation for nongrey material is to take x = Xr 

/ce Kp, evaluating both xr and Kp at the material temperature T. This 
approach is reasonable if radiation is not too far out of equilibrium with 
the material. This is the usual approach, 

— To do anything better we need to know the profile eo(i^o)- An idea suggested 
by [10] is to take the two-temperature picture literally and define the two- 
temperature means 



ke(T,Tr) = f 
Jo 



Ko{iyo:T)B{uo,TK)duQ 



/j: 



Biuo^TB^di/Q 



and 



1 

x(T,Tr) 



rcc ^ 




di/Q 

Tr 


Jo Xo{i^o,T) 


dT 


' 


[05(1/0, Tr)1 

Jo [ dT J 


di/Q 

Tr 



(7.32) 



(7-33) 



The basic idea here is to get something we can actually compute, while 
still allowing the spectrum of the radiation (characterized by Tr) to depart 
from Bi,{Tm 2 LteTi&\)^ for example in the case of hot radiation penetrating cold 
material) . 
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To improve the treatment further, we would have to solve the monochro- 
matic diffusion equation. Let us consider this task briefly. The monochromatic 
diffusion equation is complicated by the presence of the (9/9//o) operator, 
whose presence implies that the equation is a partial differential equation 
in (r, t,z/o) instead of just (r, t). How should we solve this equation? Note 
that the term with (9/5 z/q) is in conservative form. It will therefore van- 
ish when integrated over all frequency. Hence it cannot affect the integrated 
diffusion equation. So why not just drop {d/di/o)^ This is a bad idea. A sim- 
ple gedankenexperiment shows why: Consider an adiabatic enclosure of an 
opaque, homogeneous medium (which means that the radiant flux will be ex- 
actly zero) with only Thomson scattering {k = 0;grey). Then the monochro- 
matic diffusion equation reduces to: 

DlnEo(z/o) 4Dln^ 1 d[uoEo{i'o)]D\ng __ 

Bt 3 Dt Eo{uo) duo Dt ~ ^ ‘ ^ 

Now drop the term containing (dfduo). We get 



Dln£JoK) 4Dln^l_^ 

~Dt 3^="’ 

which implies that Eq{uo) oc This result is correct for the integrated 
radiation in an adiabatic enclosure, but is wrong for monochromatic radia- 
tion. The point is that if the volume of the enclosure decreases, then Tr rises 
because a which implies that Tr a Because the radiation 

temperature changes, the spectrum of the radiation B(i/OjTr) necessarily 
must change. 



- Therefore, unless we retain {djduo) we get the wrong monochromatic spec- 
tral profile e{uo) = E^/E because we have ignored the adiabatic red/blue 
shift of the radiation when the material expands /contracts. 

- But the whole point of doing the monochromatic problem is to get the 
correct spectrum! 

- So, in short, we have no choice but to solve the full equation. Direct solution 
may be too costly, therefore we examine an iteration scheme. 

Let us discretize the frequency variable with G groups (or ODF [opacity 
distribution function] pickets [22]). Then we must solve G diffusion equations 



Dt 







d c(47rr^)^ dEg 

dMr _ ^iXg/Q dMr_ 



-h 



4nKp^gBg - CKgEg 

Q 



along with the material energy equation 



De D 
Dt 






(7.36) 



(7.37) 
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Here Hg and xr ,5 ^re appropriate direct and harmonic averages. Possible 
choices are: (a) ODF with kg = /Cpicketi and xr,^ = Xpicket- (b) Group means 
kg = and xr ,5 = Xr,p? computed at the material temperature T. (c) 
Two-temperature group means as in (7.32) and (7.33). Among these choices 
(a) is probably best; (b) is the most commonly used, and (c) may never have 
been tried. 

The Ag in (7.36) are discrete representations of the integral of d{uE^)!dv 
over {ug^ ^g+\)' In writing them, one must remember that if the material (ex- 
pands I contracts), photons are (red | blue) shifted. For numerical accuracy 
and stability we use upstream differencing. For example, for expansion, pho- 
tons come into group g from group pH-1. Recall that this frequency-derivative 
term is conservative, and its finite difference representation telescopes to zero 
when summed over the entire spectrum. 

As before we assume the material properties are all /(p, T) [or /(p, T, Tr)], 
and that we are using explicit hydrodynamics. Then at the goal is to 
determine a solution vector 

- (FJi , £ 2 , . . . , r)rf, (d = 1, . . . , D) (7.38) 

which gives the spectral distribution of the radiation field, and the local 
material temperature. As before, we linearize the equations and get a system 
of block-tridiagonal equations. Each block is of dimension (G H- 1) x (G -h 1). 
Therefore the computing effort for a direct solution scales as D{G H- 1)^, 
and the process is iterated to consistency. If the linear system leading to 
(7.38) is solved by iterative methods, considerable speed-ups are possible. An 
alternative is to use an iteration method ba^ed on physical considerations. 

A popular iteration procedure is the multifrequency /grey method (MFG) first 
introduced in the VERA (Variable Eddington Radiation Approximation) [11] 
code. The basic idea is to split the problem into steps: compute T and Tr at 
using frequency-integrated equations, assuming that the spectral profiles 
needed to compute the necessary mean opacities are given. Then use the 
multigroup equations with T and Tr given to calculate the spectral profiles 
needed by the opacity weighting procedure. Thus: 

- Use known profiles from t'^ to evaluate x ace- 

- Solve the gas energy equation and integrated radiation energy equation to 
get (T,Tr)"+i. 

- Using these temperatures, solve the multigroup radiation equations. Move 
all Ag terms (i.e., the frequency derivatives) to the right-hand side. We 
then have G uncoupled equations. Solve these equations in parallel. Using 
these results, evaluate new estimates of the Ag terms. Then re-solve the 
uncoupled system. Iterate to convergence to obtain the spectral profiles at 

- Using the new spectral profiles, update the estimates of x smd ke at 

- Then do the timestep {t^,t^~^^) over (possibly iterating to convergence), 
or kiss it goodbye and take the next timestep. 
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For astronomical problems MFG usual works well. But for very hard prob- 
lems, e.g., with material interfaces or large departures from equilibrium, it 
may not. 

Critique of Earlier Treatments of Diffusion. The correct nonequilibrium dif- 
fusion equation was first written by John Castor [4]. Before his Lagrangian 
transfer equation was known, it was not possible to derive the correct equa- 
tions rigorously, and most people made (incorrect) guesses from heuristic 
arguments. 

Consider a typical approach: start from the Eulerian radiation and mo- 
mentum equations, and then: 

1. compute F as if it were Fq (which it is not); and 

2. substitute this expression into the energy equation, dropping all 0(u/c) 
terms on the right-hand side to get: 

^ = K(4nB -cE)-V- Ve) . (7.39) 

The problem is that this equation is neither Eulerian nor Lagrangian. Indeed, 
the fact is that it is just wrong: 

— Although the right-hand side looks like our Lagrangian diffusion equation 

(7.24), the left-hand side is hopeless in that frame. It ignores the radiation 
work term and fails to discriminate between (^) and ^^(^)- 

So it is not Lagrangian. 

— Likewise it isn’t Eulerian because, comparing to (4.11), on the right-hand 
side we are missing a term kv ■ F/c equal to the rate of work done by 
the radiation force, and in the flux divergence, the dominant term that 
discriminates between F and Fq (i.e., ^vEq) is omitted. Actually it is 
tricky to write down a correct Eulerian diffusion equation. 

No one knows how many calculations could have been ruined by using (7.39). 
Moral: be careful of framel 



7.3 The Problem of Flux Limiting 

Everyone knows that diffusion theory applies only when Xp/i 1. But radi- 
ation hydro with radiation transport is very hard, while radiation hydro with 
radiation diffusion is relatively easy. So people have tried to push radiation 
diffusion codes into optically thin regimes. The usual result is trouble. Why? 
Symptom: one finds | F diffusion | > cE, which implies that the signal speed 
is VE = \F\/E = icAp VF|/F ~ (Ap/3^)c ':$> c because Ap ^ £, which is 
physically absurd. Root of the problem: (linear) diffusion theory insists that 
a photon always travels one mean-free-path per timestep, even if Ap exceeds 
the free-flight length cAt. [Remember that the linear diffusion equation has 




232 D. Mihalas 



an infinite signal speed. In nonlinear diffusion the signal speed is bounded 
below c.] 

To overcome the problem, people use an ad hoc flux limiting procedure 
invented by [2]. The idea is to control the flux with an expression of the 
general form: 



-cVE 

(3/Ap) + \VE\!E • 



(7.40) 



Then when Ap -(cAp/3)VJ5, which is the usual diffusion result, 

whereas when Ap ^ cEn^ where n is chosen to point in the direc- 

tion of —VE. This kind of flux limiting procedure is certainly qualitatively 
reasonable; many other formulations have been suggested. But in the end, 
flux-limiting is a fixup. To really overcome the problem, one must retain the 
time-derivative, which becomes important on a radiation-flow timescale, in 
the radiation momentum equation. To gain insight, ignore all 0{vfc) terms: 



1 dF djfE) ^ Xp 
dt dz c ' 



(7.41) 

(7.42) 



Replace the time derivative (dF/dt) with a difference formula; for stability 
use a fully implicit backward Euler formula. Then 



_ J7n 

c^At ^ dz 




(7.43) 



which implies 









7 + 



dz 



7 

7 + 






(7.44) 



where 7 = 1/cAt. To understand (7.44), define 
1 _ 1111 



(7.45) 



which implies that the effective mean free path A is the harmonic mean of 
the free-flight length At and the optical mean free path Ap. Then 

(7.46) 

Equation (7.46) gives us an algebraic expression for , which we can use 
to eliminate the flux from the radiation energy equation. 

In opaque material, x ^ ^ hence A — ^ Ap and / — ^ 

Then F^^^ ~{cXp/S){dE^'^^ /dz) which is the correct diffusion limit. In 

transparent material Ap > At, A Xt,f 1, and F cE. Then F^^^ = 
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— Xt{dF^^^ Jdz) which implies that a change in the local flux over a 
timestep reflects the flux gradient over a photon free-flight length At. Now use 
(7.46) in the radiation energy equation to get a combined moment equation. 
Again replace the time derivative with an implicit time difference. We get 



+ 

oz 



dz 



c c dz 



In the limit of high opacity and/or long timesteps, /c/7 1, so 



(7.47) 



(7.48) 



which is just the quasistatic nonequilibrium diffusion equation. In the limit 
of low opacity and/or short timesteps /c/7 1. Setting / = 1 we get 



^ 2 ^ 71+1 1 

- {cAtf = E^ 5 ^ 2B" - . (7.49) 

oz^ 7c dz 

To obtain the last expression we used the fully implicit representation of the 
radiation energy equation (not the combined moment equation) at Thus 



^ ^n+l _ 2E^ + ^ d“^ E 



(7.50) 



which is an approximation to the wave equation (mis-centered in time because 
all time differences were fully implicit instead of centered). 

In summary: 



- The flux limiting problem is an artifact of dropping ^ ^ in the radiation 
momentum equation. 

- This problem can be overcome if we solve the time-dependent moment 
equations (both), with accurate Eddington factors. (/ = | won’t do in the 
streaming limit; this would imply a wave speed of c/^/Sy not c) 

- While it is practical to overcome the problem in one-dimensional flow, flux 
limited diffusion remains the only realistic option in workhorse codes in 2D 
and 3D, at least at present. (An alternative is to use Monte Carlo, which 
will employ parallel processors.) 
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8. Shock Propagation: Numerical Methods 

We will sketch here how to follow the propagation of a shock numerically. 
References: [28, 32, 6, 12]. 

8.1 Acoustic Waves 



Consider the propagation of ID adiabatic acoustic waves, in planar geometry, 
with no external forces. The equations to be solved are: 
momentum conservation 



Du dp 

Dt dm 


(8.1) 


definition of velocity 




Dx 


(8.2) 


continuity 




1 dx 

^ g dm 


(8.3) 


energy conservation 




De BV 


(8.4) 



Discretize these equations: Choose a set of mass cells ruj and a set of 
times The ruj are Lagrangian coordinates. As shown in Fig. (7.1), as- 
sign thermodynamic variables to cell centers: 7 

Assign position markers Xj and velocities vj to cell surfaces. De- 
fine Amj^ij 2 = Then discrete representations of the differential 



equations are: 

(^n+l/2 _ ^ - p^_y^)/Amj , (8.5) 

, ( 8 . 6 ) 

= {x-tl - . (8.7) 

<iV2 - ^"+1/2 = - ^+ 1 / 2 ) • (8-8) 



HereZ\i”+*/2 = = \{At”~'^/^+Af'+^/'^)- Aruj = + 

^"lj + l/2)- 

Given initial conditions {ij}", {^j+ 1 / 2 }”, {ej+ 1 / 2 }", {pj+ 1 / 2 }", we 

can solve (8.5-8) in the order listed to find the solution one timestep later 
(t"+*). 
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To solve Eq. (8.8), one must specify the time-level for Pj+i/2- (a) Suppose 
the gas is perfect. Then p = (7 - l)pe, so we can eliminate p in terms of 
e. We can then use the time-centered value (expressed 

in terms of e), because then the only unknown in (8.8) is Then the 

equation is trivial to solve, (b) But more generally, p = p(^, e). We then have 
two choices: 

1) Use ^hich is not time-centered. But this gives us an explicit 

formula because only is unknown; again the equation is trivial to solve. 

2) Use the time-centered value |(Pj^i/2 “*"Pj+i/2) iterate to get con- 
sistency between ^j+i/2’ method is implicit 

The explicit formula is obviously cheaper. But as we see next, we must almost 
always use an implicit energy equation in order to assure numerical stability. 



8.2 Numerical Stability 

Experience shows that if we solve (8.5-8) with a sufficiently small timestep 
At, we get a smooth variation of the solution in time. But if At rises above 
some critical value, the solution abruptly develops unphysical oscillations that 
grow and swamp the true solution. That is, the solution becomes numerically 
unstable. We must therefore do a stability analysis of our finite difference 
scheme. We will sketch here the von Neumann local stability analysis which 
provides a necessary (and, sometimes, but not always, sufficient) criterion for 
stability. 

If = L{Xyt,y) where L is a linear difference operator on a uniform 
spacetime mesh Xj = j Ax, = n At, then an exact solution for y^ is 



k 



(8.9) 



The Ak^s are determined by initial conditions. Each harmonic k grows or 
decays independently of all others. If the solution is to be stable, no harmonic 
should become unbounded. Therefore we must demand that the modulus of 
the amplification factors ||^;fe|| < 1 for all k. 

Example: Advection. Consider the advection equation 



dg dg 

dt dx 



= 0 



( 8 . 10 ) 



where = const > 0. The solution of this equation is that the original 
density profile moves uniformly to the right with velocity vq, i.e., g{x,t) = 
f{x~vot) where / is an arbitrary function. Suppose we represent (8.10) with 
the difference equation: 



- Qj = - q^_i)/2Ax . 



( 8 . 11 ) 
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This is an explicit difference equation. Take a trial solution of the form: 

■ (8.12) 

Inserting (8.12) into the difference equation we find = 1—iasinkAx, where 
a = Vo At f Ax, The amplification factor is thus || ^fc|| == (1-ha^sin^ kAx)^^"^ > 
1. Therefore, the difference scheme is unconditionally unstable,, i.e., it is un- 
stable for any At > 0, no matter how small! Thus (8.11) is useless. So try 
another scheme: 

= -MqU - ■ (8-13) 

This equation is known as the leapfrog formula. Take the same trial solution 
as before. Now we find — 1 = 0 where = a sin kAx. Solving for 

we find 

^k = -il3±il-py^^ . (8.14) 

Thus if (3^ > l,||^ib|| > 1) and the method is unstable, whereas if < 1, 
II ^fcll = 1. Thus the leapfrog scheme is conditionally stable, i.e., it is stable if 
I sin A:zla: |< 1 for all k, which implies that we get stability if 

At < Axfvo . (8.15) 

This result is known as the Courant condition (or more correctly as the 
Courant~FHedrichs~Lewy condition, or CFL conditiori). 

Physically the CFL condition demands that the timestep be small enough 
that we don’t propagate the solution more than one cell- width per timestep. 
The Courant condition will appear again and again in what follows. The 
moral of this analysis is that: 

— A small change in the form of a difference equation can have major (perhaps 
catastrophic) consequences for stability. 

- One must always check a proposed scheme for stability. 

8.3 Systems of Equations 

The analysis above applies to one equation. Suppose we have several coupled 
equations: 

y'^ = L{x,t,y) (8.16) 

where L is a linear operator (matrix). Represent the solution by a Fourier 
series with amplitudes afc. One can show that (8.16) reduces to: 

= G< (8.17) 

where Q{At, k) is called the amplification matrix. By analysis (see Richtmyer 
and Morton) one now finds that the necessary condition for stability is that 
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the spectral radius of G be bounded by 0{At). Thus if Ai , . . . , A*, . . . , Am are 
the eigenvalues of G, then we must have 



I Aj |< 1 -f- 0{At) 



(8.18) 



for all for all k, for 0 < < r. 

For example, consider again acoustic waves. To simplify the algebra 
rewrite the momentum equation as: 




Difference (8.19) as: 



= (^) (^) , (y,1u, - n- ./.) (8-20) 

and couple to the time difference of (8.7), 

- n"«/2) = (1^) («w ■ (8-2') 

Take trial solutions of the form 

= (^^)n+l/2gifcj4m (g 22) 

and 

(g 23) 

We then find the amplification matrix: 

(8.24) 

where a = 2At/ Am sink Am/2, and /3 = {a/V)a. Calculating eigenvalues we 
find 

A = i{(2-^)2±[(2-/?)2_4]'/"} . (8.25) 

Then if (3^ > 4, ||A|| > 1 which shows that the scheme is unstable, whereas if 
< 4, then A lies on the unit circle in the complex plane, and the scheme 
is stable. Therefore for stability we must have 

< ^^sin < 1 (8.26) 



for all k, which can be guaranteed if and only if 

^ Am Ax 

At < = — , 

ag a 



(8.27) 



that is, if and only if the Courant condition is satisfied for the sound speed 
(which is the relevant wave speed) in the fluid. 
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8.4 Implications of Shock Development 

- Recall that nonlinearities in adiabatic wave propagation inevitably produce 
shocks. 

- If we solve differential equations, we can, in principle, handle a shock either 
as a steep front on a scale Ap, or as a mathematical discontinuity (a kind 
of “internal boundary condition”). 

- But difference equations can’t handle discontinuities; the solution must be 
smooth on a scale Ax, or the difference quotients give garbage. 

- The equations written above have no dissipative terms. Therefore even if 
they could handle discontinuities, they would not yield an entropy increase 
behind the shock. 

- Suppose we put in a molecular viscosity and try to resolve the shock front 
by brute force. This approach is doomed. We would need to reduce Ax to 
0(Ap); i.e., by many orders of magnitude. And if Ax — y Ax/k, the Courant 
condition would imply that At At/k, so the amount of computing re- 
quired rises as A:^. This is hopeless! 

- If we treat the shock as a weak solution (i.e., a mathematical discontinuity) 
we can use the method of characteristics with shock finding and shock 
fitting. This procedure can be cumbersome and hard to generalize when 
additional physics (e.g., radiation) is included in the calculation. Other 
problem with this approach are shock detection and shock dissolution. 

Von Neumann and Richtmyer suggested a clever scheme that handles shocks 
automatically whenever and wherever they occur. The basic idea is to use 
an artificial viscosity (also called pseudoviscosity) which produces entropy 
like ordinary viscosity, but smoothes transitions in the flow over a few typical 
zones (scale ~ Ax) instead of a few mean free paths. 

We can write the ID equations for planar viscous flow exactly as: 

and 

where q is the energy input per unit mass from “external” sources (e.g., radia- 
tion) and Q is the equivalent “viscous pressure” 

4 dv 

(8.30) 

and 

/ 3 , 

M =M+ • 



(8.31) 
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An analytical study of shock structure shows that viscosity guarantees con- 
tinuous shocks. Therefore we are motivated to try to use the Von Neumann- 
Richtmyer scheme in a numerical scheme. A crucial question is: how do we 
choose the coefficient fj.'7 One option would be to use the formula for molec- 
ular viscosity, but with A replaced by £ ~ Ax. This would be a bad idea. 
There are two problems: 1) this choice of fi artificially reduces the Reynolds 
number everywhere in the flow, and thus gives a bad represention of the 
smooth part of the flow; 2) the thickness of a shock ~ A/(M^ - 1); therefore 
weak shocks will be smeared much more than strong shocks. This gives a bad 
representation of the flow. 

The brilliant idea of Von Neumann and Richtmyer is to use a nonlinear 
pseudoviscosity chosen such that it is as large as we like in shocks, but then 
drops quickly to zero outside the shock. They chose 




(8.32) 



with 






{ 



e^{Dg/Dt) 

0 



for (Dg/Dt) > 0 (compression) , 
for (Dg/Dt) < 0 (expansion) . 



(8.33) 



Note that [/x] = (g/cm^)(cm^/s) [£] = cm (length). In practice one chooses 

£ = kAz^ where Az is an average zone size and A; is a number of order unity. 
Thus, 



- The main virtue of this methods is that it works. 

- Q comes quickly into action when the gas is compressed (e.g., in shocks). 

- Q is small or zero away from shocks. Therefore it doesn’t affect the back- 
ground flow spuriously. 

- From experience we find that we get about the same thickness for all shocks 
regardless of Mi. For A; ~ 1, / ^ Az. 

- From an analysis of shock structure one can show that we get the correct 
connection between upstream and downstream flow conditions as predicted 
by the Rankine-Hugoniot relations provided Q 0 3^ {dv/dz) 0 in 
the upstream/downstream flow far from the shock. The Von Neumann- 
Richtmyer artificial viscosity meets this requirements; therefore it preserves 
the Rankine-Hugoniot relations. 



8.5 Implications of DiflTusive Energy Transport 

In a real gas there will always be diffusive energy transport such as thermal 
conduction or radiation diffusion. This fact has important consequences in 
numerics. Consider the problem of linear heat conduction: 
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OT _ d^T 
dt ^ dx^ 



First try an explicit formula: 

= (^) (^iVi - 27T + V) • 

From a Von Neumann stability analysis one finds 

= 1 + Q:(cosA:zi2: - 1) , 



(8.35) 



(8.36) 



(8.37) 



where a = 2Ata/Ax'^. One thus sees that < 1 for all a. To bound 
away from —1, we must have a < 1. Therefore the timestep must satisfy the 
condition: 

^ , (8.38) 

which is very restrictive. If we refine the x-grid by a factor of k, then we must 
reduce At by a factor of Therefore the computational effort rises as k^. 
This result is very bad; we must find a more stable scheme. So try an implicit 
scheme: 

T"+1 - r; = ia [0{S^T)^+^ + (1 - 0)((J2T)”] , 

where 



(8.39) 



{5^T)j = Tj+,-2T^ + Tj. 



and 



0 < 6 >< 1 . 



(8.40) 



(8.41) 



The value 0 corresponds to an explicit formula and 1 corresponds to a fully 
implicit formula (backward Euler). From a Von Neumann analysis we find: 



= 



1 — (1 — 0)a(l - cos kAx) 
1 + 6a{\ - cos kAx) 



(8.42) 



We see that is always < 1, and is a monotonic decreasing function of 
7 = a(l — coskAx). To bound the amplification factor away from —1 we 
must have 



-1 < 



1 - 2(1 - 6)a 
l + 2o 



(1 - 20)a < 1 . 



(8.43) 



Thus, for 0 < 0 < |, At < Ax^ /2a{l — 20), and for | < 0 < l^At is 
unrestricted^ which implies the scheme is unconditionally stable. The partic- 
ular choice 0=1 minimizes the truncation error; this is called the Crank- 
Nicholson scheme. Having chosen an implicit scheme, we must solve a linear 
tridiagonal system at 
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-a,T;_V + bjT;^+^ - = rj. (8.44) 

This computation is trivial, and worth the cost because there is no limit on 
At, 

Moral: in any realistic problem of interest, there will always be some kind 
of diffusion of energy. Therefore we will always use an implicit form for the 
energy equation,, even though we may choose to use an explicit form for the 
other equations. 



8.6 Illustrative Example 



Consider flow in a ID stratified medium (e.g., a star), using planar geometry 
with explicit hydrodynamics. As a Lagraingian variable we can use the column 
mass measured downward into the star 

/ oo 

Q{z')dz' . 

The differential equations to be solved are: 





Du 



dm 



ip + Q)-g 



(8.46) 





dm ’ 



De , D /I 

^ + (P + Q)p^^^ 



(8.47) 

(8.48) 

(8.49) 



The corresponding difference equations are: 






— 9+(Pd+l/2 Pd-ll2^^d+l/2 Q d-\/2) ! 


(8.50) 


where 




= ^(zimd_i/2 + Amd+1/2) 


(8.51) 


and 




/it" = l(Z\t"“i72 + /it"+i/2) . 


(8.52) 



In general At^~^^^^ ^ At^~^^^ because of the requirements of stability and 
accuracy. Therefore to center on the right hand side of the momentum 
equation define 
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^tn-V/ 2 ) ^ ^ 1 ^^n+l/2)j 

Zt 

= t^ + i(zl<"+^/2 _ ^(n-l/2) ^ (g 53) 

which amounts to taking the average of the midpoint of the old timestep with 
the midpoint of the new timestep, and also choose 



^n+A = \W-\ 



pZi /2 = Prf+1/2 + 



This is an extrapolation of p. For Q use 



(Pd+l /2 - Pa+1/2) 



(-Y 

VdVd- 



d+l/2 



(8.54) 



QZl/l = ki^(0d+i/2 + Pd+i/2)(vzr - ^rn 



n~lj2 n— 1/2\2 



(8.55) 



if ^ and otherwise zero. Note that Q is lagged in the momentum 

equation (this turns out to be unimportant). Remark: in implicit schemes it 
is not necessary to extrapolate p or lag Q. 

To find and we need boundary conditions. Some typical choices 

are: 



Inner: 

(a) uo+i = 0' Rigid core. 

(b) vd+1 = f{t)- Piston. 

Outer: 

(a) Free boundary; no force across first cell, 

(b) Transmitting (outbound waves pass through) 



dv dv 

gj = -(. + v)^=S(a + v) 

(c) Zero pressure. Pi + Qi = 0 which implies 



dv 

dm 



where Am\ = ^Am^f2 + rni, and m\ is the mass above the first cell. 



Having updated the velocities, we can compute 
1 



,n+l 



yn+1 _ 



^d+1/2 






Am, 



d+ 1/2 



(8.56) 

(8.57) 
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We can then evaluate Q^+1/2 • finally we can solve the energy equation 

<i'/, - «3+./2 + [5 (p 5+./2 + pXp) + (CV/2 - 

= [(1 - J = . (8.58) 

To complete the system, we must be given mechanical and caloric equations 
of state: p = p(^,T), e = e(^,T), or e == Assume the first alterna- 
tive. Linearize around some guess T*, i.e., take — ^d+1/2 <^^d+i/2* 

We obtain a tridiagonal system for (5Td_|_i/2, (d = 1 , . . . , T))* The solution is 
iterated to convergence. 

The stability analysis for this system is complicated (see Richtmyer and 
Morton). One finds that the timestep must obey the Courant condition out- 
side the shock. Inside the shock, one finds 

^ fQ^t CFL 5 (8.59) 

where 

/Q=y/2/2fcQ. (8.60) 

Trial calculations show that /q ~ | works. In practice we must also limit the 
timestep to assure accuracy. Therefore we demand that At be chosen so that 
I AT /T \< et,\ Ap/p |< Cp, and so on. It is the accuracy requirment that 
restricts the timestep in an implicit calculation where there is no stability 
restriction on At. 

To see how artificial viscosity works, see Fig. (8.1) which shows a shock 
with P2/P1 = 5 and 7 = 2. Panel (a) has kq = 2, and panel (b) has kq — 0. In 
the latter, the shock speed is wrong, and the density shows large oscillations. 
This is not a problem of numerical instability. Rather, it is the failure of the 
system to convert upstream directed momentum into heat, and a failure to 
increase entropy, both because there is no dissipative term in the equations. 
The downstream cells are acting like “macro molecules”, and there is no 
hope they will ever thermalize. This is a vivid illustration of why one must 
use artifical viscosity. (Or an equivalent mechanism.) 

Total Energy Check. From the differential equations of the system, one can 
derive 

^(e+ + + (8.61) 

Integrating over mass, we get an exact conservation law 

D /■, 1 2 

y (e + 2 ^ + 9z)dm 

= + qdm + (p + Q)d+iVd+i — (p + Q)iVi . 



( 8 . 62 ) 
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Fig. 8.1. Shock computed (a) 
with and (b) without artificial 
viscosity. Prom [28] by permission 



The first term is the rate of change of the total energy in the system. The 
second is the heat deposited. And the last two terms give the surface work 
terms. But the difference equations do not guarantee the above integral ex- 
actly. So in practice we compute a discretized version of the total energy as 
a check: 



^n+l ^ X] {®d+l/2^’^d+l/2 + 
d 

+ - ^+1 
= ^ ^'d+\/2^‘<^d+i/2 + \{Vdf^rnd 






^rna 



— = constant . (8.63) 



Here 






k=0 *- 






- zf) 



(8.64) 
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and 

^ . (8.65) 

A:=0 d 

Ideally, £ should be conserved to high accuracy. If it isn’t, one should reduce 
the timestep. 



9- Numerical Radiation Hydrodynamics 

Consider now the ID spherically symmetric flow of a single material, as might 
occur, for example, in a star. Use Lagrangian variables and coordinates. 



9.1 Radiating Fluid Energy and Momentum Equations 

All variables are measured in the comoving frame, so for simplicity drop the 
suffix “0”. Retain all 0{vjc) terms in the energy equation; drop all t?-de- 
pendent terms in the momentum equation except (D/Dt) (for flux limiting). 
Then the monochromatic energy equation (see Sect. 5.3) is 



= — 47tB, - cE^) - ’ 

Q dMr 

and the monochromatic momentum equation is 



1 DF^ 



+ 47rr^g 



d{fM , (3/.-1) 



c 



c^Dt ^ dMr r 

Define a sphericity factor as 

'"’-'I.—*- 

Then we can rewrite the momentum equation as 

1 DF. 47rr2^ d (Uq.E,} ^ 1 

c2 Dt dMr ■ 

Integrating over frequency, we get the total radiation energy equation 



(9.1) 



(9.2) 



(9.3) 



(9.4) 




246 D. Mihalas 



Dt 




= - {ilTKpB 
Q 



-) -(3/-1)- 
QJ QT] 



ckeE) — 



E 

d (4nr^F) 



dMr 



and the total radiation momentum equation 
1 DF 4Trr^gd{fqE) _ 1 

Here 




(9.5) 

(9.6) 

(9.7) 

(9.8) 

(9.9) 
(9.10) 



We can represent (9.5) and (9.6) as difference equations on a staggered grid. 
We locate /, q, «p, and ke at cell centers, and F and xf at cell boundaries. 
For the time differencing we use an implicit scheme for stability; a common 
choice, if we don’t care about following radiation fronts moving with u ~ c, 
is fully implicit (backward Euler). We then have a choice: we can use either 
two 1st order equations for E and F at or one 2nd order equation for E 
(eliminate F). The choice is made on the basis of computational convenience; 
the physics is unchanged once the equations are discretized. 

The radiation equations above are coupled to the standard material en- 
ergy and momentum equations: 



^ GMr . z ip + Q) XpF 

Dt 7-2 dMr QC ’ 

Dv GMr , 2 (p + Q) . IdifqE) 

^ + (p + <3)^ (^) ^e + (ckeE - 4TTKpB)lg , 



(9.11) 



(9.12) 



(9.13) 



where Q is the pseudoviscous pressure of Sect. 8.4. 
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9.2 Computational Strategy 

Overall Procedure. It is costly to try to solve directly the full angle-frequency 
dependent radiation equations coupled to the equations of hydrodynamics be- 
cause the dimensionality of the problem (= 4) is too large. So we look for 
an approximate technique. Notice that we need only the frequency-integrated 
moments to solve the material energy and momentum equations. Therefore 
we are led to try a splitting technique in which the spacetime evolution is han- 
dled separately from the angle-frequency information. The method described 
below is called the multifrequency /grey method. For ease of exposition, as- 
sume we use explicit hydro. 

1. Suppose we are given the solution at t”. Update velocities to (ma- 
terial momentum equation), hence radii and densities to t^^^. 

2. If we can assume that the spectral distributions {EulE) and {Fu/F) are 
known at t^"^^ (actually they are not\) we can calculate ke and xf, 
hence /ce = «e/kp and fcp = Xf/xr- Similarly, if we can assume that 
the angular distribution of the radiation field is known at (again, it 
isn’t), then we have /„ and q^, hence / and q. 

3. At we can now solve the gas energy equation plus the radiation en- 
ergy and momentum equations. We thus obtain the run of 

and This system is nonlinear because the material properties de- 

pend nonlinearly on ; to solve it we linearize and iterate to conver- 
gence, as described earlier. 

In principle, these steps give us the solution at the advanced time. But ac- 
tually they don’t because we assumed we already knew {Ej^/E), (F^/F), /i/, 
and qi, at when in fact we didn’t. Therefore at the advanced time level 
p+i we should now: 

4. Solve the angle-dependent transfer equation for the geometric factors 
fj, and qu using source-sink terms evaluated at [i.e., k„(T’^'^^), 

]. 

5. Then solve the frequency-dependent moment equations for the spectral 
profiles {Ej,/E) and (F,^/F). Use these updated profiles to update the 
information in /ce,Xf,^e, and A:f at 

6. Go to step 3, and iterate to consistency. 

— In step 4, we ignore frequency- coupling in order to reduce the dimension- 

ality of the problem to (r, /i) at a given (i/, t). From this step we derive the 
geometric factors qi^ which depend mainly on angular information 

about the radiation field. 

— In step 5, we dispose of angular information by using moments, and attempt 
to improve the frequency-dependent information contained in {Ei^/E) and 
[Fu/F) by accounting for the (p,r) coupling at a given t, assuming that 
the geometric factors [fi^^qv) are known. 
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- This procedure works fairly well because is only a ratio which is mainly 
geometry dependent, whereas {Eu/E) and {Fy/F) are only spectral pro- 
files^ which are insensitive to geometry. 

- Very few calculations have been done to this level of consistency. Most ei- 
ther ignore the frequency-coupling in the moments, and lag the Eddington 
factors, or they handle the frequency-coupling but assume | (multi- 
group diffusion). 



Spatial Differencing and Unresolved Fronts. It is straightforward to write 
difference formulae for the equations written above. We won’t produce them 
here; the reader can easily work them out. But consider one very difficult 
problem: how can we calculate a good value for xf at an interface? A seem- 
ingly reasonable value giving the correct optical depth increment between cell 
centers is 

_ ix/Q)d-\l2^rnd-ii2 + (x/g)d+i/2^»^d+i/2 
\e)d + /imd+i/2 

But trouble arises when we fail to resolve a front in the flow (a shock, ion- 
ization front, ...). The problem is that the opacity can vary as with 
12— 14 in regions where H and/or He get excited and/or ionize. There- 
fore even small changes in T between zones imply a huge change in x- This 
rapid variation in x makes it hard to choose the correct value at the inter- 
face, and the errors made can clobber the energy transport at the interface. 
Generally the result is disasterous. How can we fix this problem? R. Christy 
[6] found that we can get “reasonable” results using an energy-weighted har- 
monic mean: 



Gy," 



(9.15) 



This recipe has been very popular, but it is only ad hoc. The real solution 
to the problem is to use an adaptive grid. The problem can’t be solved by 
brute force with tiny Lagrangian zones. For example, the ionization zone in 
Cepheids and RR Lyrae envelopes is about 0.001 to 0.01 scale heights thick, 
and moves back and forth over 3-5 scale heights during a pulsation cycle. 
A prohibitive number of zones (and timesteps!) would be required to resolve 
the front. In contrast, an adaptive grid moves with the front and resolves it. 
This approach is the preferred technique. 

Method of Solution. For the explicit hydro scheme sketched above, at t^'^^ we 
get equations coupling , F^'^^ at d = 1, ... , D. Linearize the equa- 

tions around the current estimate of the solution. We get a block tridiagonal 
system which can be solved by Gaussian elimination, and iterated to con- 
vergence (Newton-Raphson procedure). In linearizing opacities we can write 
= kEi^p and xf = ^fXr and assume {kp^kp) remain fixed. Or (better) 
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we could calculate {dK^/dT) and (9 xf/ 5T) from {dKgjdT) and {dxgldT) 
in groups (or an ODF) given {EgjE) and {FgIF). 

If we do implicit hydro instead of explicit, then at we must solve 
for r, u, Q, T, E, and F. We then have a block tridiagonal system with (6 x 6) 
blocks, or (7 x 7) with an adaptive grid. In principle one can take large 
timesteps with an implicit system, but in practice accuracy requirements 
severely limit especially if there are unresolved fronts. Again an adaptive 
grid is the only solution. 



9.3 Energy Conservation 

As for a nonradiating fluid, we can write a total energy conservation theorem: 



Dt 



+ [47rr^?;(p + P -h 



^)dM. 

Q)] +[L] = j edMr , 



(9.16) 



where [A^] — Aouter boundary Aimier boundary* But as in Sect. 8.6, difference 
equations don’t guarantee conservation, so we use a discrete representation 
of the conservation theorem (9.16) to monitor the numerical accuracy of the 
solution. 



9.4 Formal Solution 

How do we update the Eddington factors? In the past the best that has been 
done is to use a static snapshot That is, we drop D/Dt and all velocity- 
dependent terms, and solve only the static spherical transfer equation. This 
procedure is probably all right in low-energy astrophysical applications where 
v/c I and t\ <^tp^ But in more dynamical situations we should perhaps 
try to do better. Write the comoving frame transfer equation as 



0^ , A 2 









u^\ 

du j 



Vu - 



cr c ut 






(9.17) 



— All terms in the curly bracket are 0(u/c). 

— All terms in the curly bracket vanish identically when integrated over (/x, z/) 

— The remaining terms describe transfer along straight rays in spherical ge- 
ometry, and account for the velocity-dependent rate of work done by radi- 
ation pressure. 
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These observations suggest that when solving transfer equations to get the 
angular distribution of the radiation field we can simply drop all terms in the 
curly bracket The result is called the model Lagrangian transfer equation^ 
which: 

- accounts for the major geometric effects that determine the angular distri- 
bution of the radiation, hence /i,, 

- yields the correct integrated radiation energy/momentum equations (in- 
cluding all necessary velocity-dependent terms) when integrated over (/x, u). 

The resulting problem is of the same dimensionality as a static snapshot: 
(r, p) at a given (z/, t). 

Prom the model transfer equation we can form symmetric/antisymmetric 
equations: 



(t. 

c Df \ 0 



= r}u- 



^ 2 (1 - dhv 



cr c Dt 



3v 



(9.18) 



and 



Is ( 7 ) = -X,K*O{vl0 . (9.19) 



We can either difference these equations along tangent rays: 

i M Q 2x ^ fi^Dlng 

Xt/ -h (1 - 3/i^) 



Q 
c Dt 



D (3u\ ^ OK _ 
Df ( e j 5s 



cr c Dt 



Ju 



and 



1 - -y /l 

c D< ^ 5s ~ ' 



(9.20) 



(9.21) 



or we can write them in conservative form and difference on the (r, p) mesh: 

i^(hL 

cDt\Q^ 



= r]v 



, , d{r‘^K) , 15[(1-m")/i.] 

+ — + i 

oMr r dpi 

, /1 o 2xV Dlnp] . 
+ 3m )^ ^ Ju 



(9.22) 



and 



pL Dhj/ 

c Di 



+ A'KQjl^ 



djr^j.) 

dMr 



•f 



Ju 



1_5 

^ r d/j, 



[m(m^ - 1)> 



= -iJ'XuK ■ 



(9.23) 
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9.5 Multigroup Equations 



Having determined the angular factors by the solution of the angle-dependent 
transfer equation, we next solve the monochromatic (multigroup) moment 
equations for the frequency-profile of the radiation field. We obtain appro- 
priate difference equations by differencing the monochromatic radiation en- 
ergy/momentum equations. There is one important technical point: in the 
term 




or J J 



(9.24) 



the effective “advection velocity” of photons in frequency space is 



1 _ Dl^ {3U - l)t; 

Dt ^ r ■ 

As in multigroup diffusion theory, we want to use upwind differencing, and 
must choose the direction of the frequency difference according to the sign 
of Du/Dt, Thus if Di//Dt > 0, photons are blue-shifted during a timestep, 
hence we couple group g to group ^ ~ 1; in contrast, if Di^/Dt < 0, photons 
are redshifted and we couple group g to group ^ -h 1. In diffusion theory the 
term in {3ft, — 1) = 0, so a/1 groups couple in the same direction (i.e., all to 
^ - 1 or 1). But in transport, the two terms compete, and the direction of 
coupling may change for different groups depending on the signs and relative 
sizes of the two terms. 



9.6 An Astrophysical Example 

Now let’s look briefly at a nice example of radiation hydro in action. 

9.6.1 Supernovae. Reference: S. Falk and D. Arnett [9]. 

- Type II supernova, M > Mq. Core exhausted to Fe. 

- Endothermic reactions trigger collapse to nuclear densities. 

- Infalling envelope bounces on core. Drives intense shock outward. 

- Total of ergs released into envelope on a timescale typical 

dynamical timescale td = R/a ~ 6 months. Outer envelope is blown away. 

The presupemova has the structure of a red giant: 

- core: contains 1-2 M^^r ^ 10^-10® cm. This core collapses to a neutron 
star or a black hole composed entirely of Fe. 

- mantle: contains a few Mq composed mainly of C, O, Ne, Mg, etc., with 
a radius of r ~ 10^^ cm. 

- envelope: composed of He (including He-burning shell). Perhaps containing 

H and a H-burning shell also, r ~ cm. 

- circumstellar shell: 0.01 M^^r ^ lO^^cm. This is the material transported 
outward in a stellar wind. It is important because it affects the supernova 
light curve. 





R 

Fig. 9.1. Supernova explosion in a 5M© supergiant with circumstellao: shell, (a) 
Light curve (is = i(s)/10®). (b) Velocity as a function of mass, (c) Temperature as 
a function of mass, (d) Density as a function of mass, (e) Density as a function of 
radius. From [9] by permission 
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Basic Scenario, 

- The shock emerging from the bounce is markedly supercritical 

- The shock drives an intense radiation front into the upstream material. 
In the shock precursor, the radiation pressure ';$> gas pressure, by about a 
factor of 100. 

- Therefore the shock is heavily mediated by radiation, and the velocity 
jump is quite small. The effects of radiative diffusion are so large that the 
calculation can proceed many timesteps with no artificial viscosity, without 
having stability trouble! 

- As it runs down the density gradient in the envelope, the shock strengthens, 
steepens, and accelerates. 

- An external observer is unaware of the shock until it reaches r ^ I a,t some 
wavelength. The first signal to emerge is a soft X-ray burst. The luminosity 
maximum in the visible occurs 4 to 20 days later! 

- The shock unloads and produces a blowoff moving at a few times 10^ 
km/sec, i.e., v/c ~ 10“^. 

- A strong viscous shock finally [penetrates the optically thin layers and 
heats the gas there to very high temperatures (10^-10^ K) and drives a 
hard X-ray burst. (Early calculations predicted, erroneously, a 7-ray burst!) 

Falk-Amett Calculation. 

- ID, spherical, LTE, grey. 

- Assumes radiation diffusion inside, and couples to transport equations (mo- 
ments) near the surface, but uses the Eddington approximation through- 
out. 

Results are shown in Fig. (9.1) for a model with a circumstellar shell. 

(a) Light curve. The computed light curve has a broad peak, like observed. 
This delay is the time td required for the radiation to diffuse out from 
the envelope (the X-ray burst is long over). 

(b) Velocity profile. The shock position is shown by tick marks. At late times 
the envelope expands with nearly constant velocity. 

(c) Temperature. The peak temperature occurs before peak luminosity. After 
the peak luminosity, the temperature decreases monotonically outward 
and in time. 

(d) , (e) Density profile: The striking feature is the thin, dense shell that forms 

after the peak luminosity and expands rapidly outward. The shell forms 
because photons leak out from the shock very efficiently, which implies 
no temperature rise behind the shock (“isothermal” shock). Therefore 
we get a gigantic density increase behind the shock; in fact, we get a 
density inversion. The latter may be Rayleigh-Taylor unstable] if so, the 
flow would fragment into blobs, which implies mixing and large turbulent 
velocities. Similar phenomena are observed in supernova remnants. 
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10. Adaptive-Grid Radiation Hydrodynamics 

Reference: [8]. 

We have seen that nonlinearities in hydrodynamics and radiation hydrody- 
namics imply that the development of “discontinuities” in the flow is in- 
evitable. Examples are shocks, contact discontinuities, dissociation/ionization 
fronts, and chemical reaction fronts. Physically these structures are not actu- 
ally discontinuities, but are regions of very steep gradients with characteristic 
scales 0(A) and 0(Ap), where A is a particle mean free path and Ap is a pho- 
ton mean free path, and the charactersitic scale-length in the flow, is much 
larger than either A or Ap. Propagation of these sharp transition regions poses 
insurmountable difficulties for finite-difference equations unless the grid spac- 
ing is Ax ~ A or Ap which is, of course, prohibitively fine. 

Thus, to obtain numerical solutions for problems with fronts, we are forced 
to consider alternative methods: 

1) We can abandon the differential equations of hydrodynamics, and replace 
them with integral conservation laws, and strict mathematical discontinuities 
imbedded in the flow. We thus obtain what are known as weak solutions of the 
partial differential equations. To make the method work we apply front fitting 
techniques using jump conditions appropriate to the front in question. 2) Or 
we can introduce artificial dissipative terms into the difference equations, 
chosen such that sharp fronts are smeared over a few typical (hydrodynamic) 
zone widths. An example of this technique is the von Neumann-Richtmyer 
artificial viscosity method. 



10.1 Front Fitting 

There are both advantages and disadvantages to front fitting. The two most 
important advantages are 1) that the front has zero thickness on the grid, 
and 2) that the Rankine-Hugoniot relations are guaranteed exactly for a pure 
hydrodynamic shock. The disadvantages of this method are: 1) the front can 
move many front-widths in a timestep, hence we must find the position of the 
front; further we must detect formation of new fronts, and we must discard 
old fronts that have dissipated. 2) The algorithm becomes progressively more 
cumbersome as fronts interact, and the structure in the flow increases. 3) This 
approach tacitly assumes steady flow through the front. 4) We do not actually 
know how to write jump conditions for all kinds of fronts. In some cases they 
cannot exist because of nonlocal interactions {e.g., by radiation) across the 
front. Finally, in astrophysical flows we typically have an enormous range of 
characteristic time and length scales. A few of these are summarized below: 
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LENGTH 


A 

Ap 

H 

R 

L 


particle mfp 
photon mfp 

scale height (exponential medium) 
gravitational scale (power law) 
size of medium (flat distribution) 




Ap/c 


photon flight (10“® sec) 




ijc 


radiation flow (msec-sec) 




iVKc 


photon diffusion (days-years) 


TIME 


tja 


fluid flow (minutes) 




RIa 


pulsation (hours-days) 




{R?IGMYI‘^ 


gravitational collapse (years) 




(GM^RL) 


Kelvin-Helmholtz contraction (Myears) 


1 


Mc^/e 


nuclear evolution (Gyears) 



Clearly we would need an excessive logial structure in the code to account 
accurately for all possible interactions on all these time/length scales. 

10.2 Artificial Dissipation 

von Neumann and Richmtmeyr [32] took the point of view that all the front- 
handling by the code should be automatic. Almost 50 years ago, they showed 
that the method works! Dissipative terms do detect and track fronts auto- 
matically. In addition, one can construct dissipative terms such that jump 
conditions are preserved across the front. On the other hand, the method 
cannot handle multiple scale-length cases well. For example, using artifical 
viscosity fronts that are “resolved” on a hydrodynamic scale may be under- 
resolved by orders of magnitude on some other physical scale. Thus in radia- 
tion hydrodynamics shock resolution of 3 to 4 /lx typical niay be perfectly all 
right hydrodynamically, but these zones may be optically thick. Hence radia- 
tion transport through these zones may be calculated completely incorrectly. 
Thus it is necessary to develop a fully automatic method that can resolve all 
structure in the flow. This can be done with an implicit adaptive grid which 
can move grid points into regions of steep gradients of the physical variables. 

10.3 The Adaptive Grid 

Definition. What is an adaptive grid? There are two ways of answering this 
question. 

1) It is a coordinate system. In hydrodynamics we can have three useful 
coordinate systems - Eulerian: (r,t) which is fixed in the laboratory frame; 
Lagrangian: (m, t) or (ro,t) which is fixed in the fluid frame; and Adaptive: 
(grid index k, t) which is attached to features in the flow, and may move 
with respect to both, the lab frame and fluid elements. It provides a mapping 
r{j,t) and m{j,t). 
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2) To implement an adaptive grid we must provide an algorithm which deter- 
mines how grid points are distributed over the domain as a function of time. 
This algorithm must detect, resolve, and track important flow features auto- 
matically. It must allow a huge dynamic range (10^-10®) in cell sizes (spacing 
of grid points) so that flow features are always resolved. It must move the 
gridpoints such that flow features are almost stationary with respect to the 
grid (important!) even though they may move rapidly with respect to the 
fluid and in real space. If we can achieve these desiderata, we can use very 
large timesteps, and still satisfy both the CFL condition and accuracy cri- 
teria with respect to the adaptive grid. An adaptive grid becomes powerful 
when we incorporate it into the set of radiation hydro equations to be solved, 
and solve for the positions of gridpoints implicitly at the advanced time along 
with the hydro variables. 

The implicit adaptive grid just described is quite different from rezoning 
schemes, much in the same sense that implicit hydro schemes differ from ex- 
plicit schemes. In a rezoning scheme we rearrange the mesh after a timestep, 
in such a way as to reduce gradients measured with respect to grid index. 
One problem always faced is interpolation on the new grid; this process may 
be noisy. If we use implicit hydro we can use small zones and still beat the 
CFL condition, but the problem of getting the Newton-Raphson procedure 
to converge will set stringent limits because a narrow front may move over 
several zones if At is too large. And the most important problem is that by 
rearrangement of the grid after a timestep, we cannot guarantee that the 
high-resolution regions will be located correctly after the next timestep. An 
implicit adaptive grid addresses both of these difficulties. Here both the fluid 
variables and the spatial positions of the gridpoints are regarded as unknowns 
at Thus we must determine both simultaneously at the advanced time, 
i.e., we move the mesh during the timestep fully consitently with the motion 
of flow features. In this way we can achieve the goal of near stationarity of 
the flow with respect to the grid. 

Adaptive- Grid Conservation Laws. In radiation hydrodynamics one always 
needs to distinguish between the frame in which the physical variables (de- 
pendent variables) are measured, and the coordinates chosen for the indepen- 
dent variables. We are always free to choose a new coordinate system if we 
know the transformation between the two sets of coordinates. We shall dis- 
tinguish carefully between the Eulerian time-derivative d/dt, the Lagrangian 
time-derivative D/Di, and the adaptive time-derivative d/di. Then the fluid 
velocity is w Dr/Di, where r is the position of a definite fluid element, 
the grid velocity is Hgrid = dr/di, where r is the position of an adaptive 
gridpoint, and the relative velocity is Urei = w — it grid measures the velocity 
of the fluid with respect to the moving adaptive grid. 

With these definitions one can derive several basic connections between 
the adaptive coordinate system and the usual Eulerian and Lagranagian sys- 
tems. For example, it is easy to show that the standard formula 
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D/ df 

Dt ~ ^ 

is exax:tly equivalent to 
d/ df 
dt ~ dt 



( 10 . 1 ) 



( 10 . 2 ) 



Likewise, by following through exactly the argument that leads to the Euler 
expansion formula 



^(lnJ/) = V«, 



(10.3) 



when the coordinate system is anchored in the fluid so that u is the fluid 
velocity and 

dFfluid = Jf dK°„i„ , (10.4) 

one can derive the adaptive grid expansion formula 



d^ «7) — V ‘ ttgrid J 



(10.5) 



where now the coordinate system is the adaptive grid, which moves with 
velocity Ugrid with respect to the laboratory frame. Similarly, the Reynolds 
transport theorem 

d 



dt 



(Lr)=L[s 



+ V • («/) 



dV7 



can be generalized to the adaptive grid transport theorem 






If +V-(«gnd/) 



dV . 



(10.6) 



(10.7) 



A moment’s thought shows that all we did is change markers in the flow from 
bits of matter moving with the fluid velocity n to a logical grid moving with 
velocity tigrid* 

These theorems allow us to integrate over a control volume on the adaptive 
grid, and thus to obtain integral conservation laws on the adaptive grid. 
For example, the continuity equation, Euler’s equation of motion, and the 
(adiabatic) gas energy equation become: 

^ (^1^ gdV^ + I^^QUr,i dS = 0, (10.8) 

(p«)«rei • ds -I- J pn dS = j gfdV , (10.9) 

and 

(pe)«rel J ■ «)dV' = 0 . (10.10) 

These equations can then be differenced directly on the adaptive grid. 
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The Grid Equation. There are basically two adaptive grids which have been 
used in astrophysics. The first was developed by Karl-Heinz Winkler for 
his remarkable code WH80s [34]. The WH80s grid equation admits of some 
deep physical interpretation, and is probably the most robust and powerful 
(i.e., highest resolution) of all schemes. On the other hand, successful man- 
agement of the grid requires considerable experience and sophistication on 
the part of the user. A somewhat simpler and easier-to-implement scheme 
was suggested by Dorfi and Drury (DD) [8]. 

DD define a grid concentration giving the number of gridpoints per typical 
length 

Hi = ^ . ( 10.11 

Xi-^l - Xi 

Here Xi is a natural length such as a scale height or the size of the domain. 
Next they define the monitor function 



Ri^ 



1 + 71 ? 



-I 1/2 






( 10 . 12 ) 



Here Sij provides a measure of the slope of a graph of some physical vari- 
able {e.g.j m, g, T^p, e, E, Q)y and Wj is a weight factor of order unity. For 
logarithmic resolution^ which is useful, we can define 

Sij = ~ . (10.13) 

T Jij 

The basic idea of the adaptive grid is to put a lot of gridpoints, i.e., make 
Hi large, wherever there are steep structures in the flow, i.e., wherever Ri is 
large. Thus we could demand 



ni Tli — 1 

Ri Ri-i ’ 



(10.14) 



which, with suitable boundary conditions provides the extra equation needed 
to determine the grid. However (10,14) is not satisfactory in practice because 
we need to keep the grid smooth^ i.e., keep adjacent zones at roughly the same 
size. Otherwise we will get unacceptably large spatial discretization errors. 
So in practice we replace m with the diffused quantity 



fii = ni- a{a + l)(nj_|_i - 2ui + n^.i) , 



(10.15) 



which forces n* to fulfill the criterion 
a -f 1 ~ n* “ a 



(10,16) 



For a = 2, (2/3) < (ui+i/rii) < (3/2), i.e., there can be about a ±40% 

change in cell size. 
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In addition, one can put in a time filter which prevents the grid from 
trying to readjust on unphysically short time scales. If At is the timestep, 
and if Tg is the minimum response time desired for the grid, then we take 

ni = fii+ (10.17) 

This formula assures that when TgfAt 0 (i.e., long timesteps) we permit 
immediate readjustment of the grid. Whereas when Tg/At -> oo (i.e., short 
timesteps) we keep the grid frozen at the old value. A reasonable choice for 
Tg is about 0.01 times the characteristic timescale of the problem. Ji Tg is too 
large, the motion of the grid will be jerky. 

The final form of the grid equation is 



hi _ hi-i 

Ri Ri-i 



(10.18) 



This grid allows a total range in cell size of about 10^, which is adequate 
to handle many (but not all) important physical problems. The final grid 
equation (10.18) is linearized along with the radiation hydro equations in the 
Newton-Raphson procedure used to solve the system. Thus the gridpoints 
move consistently in response to changes of the physical solution during the 
iteration procedure, and at convergence are situated so as to best satisfy both 
the fluid and grid equations. The mathematical structure of the system is a 
sparse matrix, consisting of a pentadiagonal set of (7 x 7) blocks. These can 
be solved in a number of ways. 



10.4 The TITAN Code 

The TITAN code is a one-dimensional adaptive-grid radiation hydrodynam- 
ics code intended for astrophysical problems with a broad variety of ini- 
tial and boundary conditions. It employs the adaptive grid generation de- 
veloped by Dorfi and Drury [8] and else follows the basic philosphy of the 
WH80s code. TITAN comes with a suite of test problems that cover purely 
hydrodynamical and radiation hydrodynamical calculations as well as time 
dependent radiation transport through static media. A reference manual 
and a user’s guide as well as the code itself can be found under the URL 
http://141 . 142.3. 131/Pubs/TechReports/TechReportsSer ies.html of 
the National Center for Computing Applications of the University of Illinois 
at Urbana-Champaign. 
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Radiation Hydrodynamics: 

Numerical Aspects and Applications 

Ernst A. Dorfi 

Institut fiir Astronomie, Universitat Wien, Turkenschanzstr. 17, A-1180 Wien, Aus- 
tria 



1. Introduction 

Since electromagnetic radiation measured at the earth is the main source of 
information about astronomical objects (beside lunar rocks, meteorites and 
neutrinos) the interaction between matter and the radiation field plays a 
fundamental role for our understanding of the universe. From the intensity, 
the spectral distribution and the shape of individual spectral lines we can 
deduce basic quantities like temperatures, the degree of ionization, elemental 
abundances, velocity fields, as well as gravitational acceleration. Clearly, a 
large amount of theoretical concepts as well as laboratory data are needed to 
extract this information on the physical states from the photons that we can 
detect at telescopes. 

In the following sections I want to set the “scene” for some typical astro- 
physical applications where a different numerical strategy is necessary, i.e., 
simulations over very large time scales covering also many spatial magnitudes. 

1.1 General Remarks on the Numerical Method 

Concerning the other lectures held during this course I want to emphasize 
some general points on the numerical technique. First, in contrast to the other 
speakers the method is implicit, i.e., the solution at the subsequent time level 
cannnot be calculated by some sort of extrapolation from the physical values 
given at the old time levels. Therefore the time step is not restricted by the 
Courant-Friedrichs-Levy (CFL) condition (Courant et al. 1920). Second, the 
method is based on a finite volume discretization ensuring the conservation 
properties of the hyperbolic equations. Third, the method does not involve 
any Riemann- solver since the inclusion of radiation, diffusion, nuclear or 
chemical reactions, opacities, equations of state of non-perfect gases, etc., can 
lead to very complicated shock structures which destroy the nice features of 
analytical shock solutions. Fourth, shock fronts axe treated by an artificial 
viscosity but the amount of viscosity can be reduced at least by a factor of 
10^ since the adaptive grid clusters around such sharp transitions. Fifthly, the 
individual grid points are freely moving within the computational domain and 
their locations are determined by the so-called grid equation. This additional 
equation is simultaneously solved together with the physical equations. Up to 
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now only a one-dimensional version is available and the last section discusses 
some applications of multi-dimensional adaptive computations. 



1.2 Time Scales 

It is almost typical for astrophysical problems that the time scales character- 
izing the physical interactions can differ by many orders of magnitude which 
is closely linked to the computational aspects. Usually we are interested in 
the evolution lasting over many short time events like travelling waves, light 
fronts, convective eddy turn-around times, etc. Nevertheless, the transport 
phenomena connected with these motions influence the long term behaviour 
and we have to be careful in computing such evolutionary sequences without 
missing the essential physics. 

Taking a stellar object with a typical dimension a luminosity L, a 
mass of M*, the mass Mnuc available for nuclear burning as well as a mean 
density p, we can illustrate some of the aforementioned problems, i.e.. 
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where Cg denotes the mean sound velocity, k the mean opacity, £^therm the 
total thermal energy and e the conversion efficiency. For a 1 Mq main se- 
quence stars these time scales vary between 10^ and 10^^ s. As stated before 
the time scale we are interested in is much longer than the shortest one, e.g., 
stars evolve on the nuclear time scale Tnuc but are disturbed by numerous 
“short” -time events like pulsations (typically on Tdyn) or thermally induced 
secular changes on nherm- In many cases such different time scales are ac- 
companied by rather stiff differential equations and every numerical scheme 
has to deal with these complications. 



1.3 Length Scales 

Beside the temporal changes of different physical processes the spatial varia- 
tions of gradients, waves, etc., are small compared to the typical dimensions. 
Again, the appropriate resolution at these regions of action is essential to get 
a correct answer to the problem. An accretion shock on top of a protostellar 
core may serve as a relevant example since for a large fraction of time during 
the collapse phase the luminosity (and its spectral appearance) is determined 
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by the details of this radiating transition zone. During a protostellar collapse 
the matter is compressed by a factor of 10^^, the temperature raises by a 
factor of 10® and within such a flow we find an almost freely falling envelope 
enclosing a quasi-hydrostatic core separated by this strong accretion shock. 

Almost all dynamical events are associated with supersonic flows leading 
to shock waves. Hence, taking Ip mean free path for photons and Im mean free 
path for molecules we can estimate some typical length scales Ax involved in 
these transitions for a Mach number M, 

A ; ^ • • 

Ax = Im -T7 ~^ — 7 » ^ VISCOUS shock wave , 

— I 

Ax 2 :^ Ip d y in a supercritical shock wave . 

This shock thickness Ax is usually very small compared to the size d of the 
astrophysical object. In computing such flows we have to develop a numeri- 
cal technique which “recognizes” and follows the formation of characteristic 
structures. 

Since the gravitational force is relevant in many astrophysical events the 
Poisson equation has to be solved for those cases. In particular, self-gravity is 
the driving mechanism for the formation of all complex large scale structures. 
This requires the solution of the Poisson equation to calculate the gravita- 
tional potential, i.e., an elliptical equation defined as a typical boundary value 
problem. Clearly, an implicit method is required to do this task. Concerning 
the length scales introduced by self-gravity we can define the pressure scale 
height Hp within a spherical hydrostatic configuration and find for typical 
astrophysical situations 

i.e., very steep gradients can be encountered. Within a stellar atmosphere 
density and pressure vary on a scale of Hp and to get the radiative boundary 
correct we have to resolve these gradients. Taking a typical star the pressure 
ratio between the center and the atmosphere is of the order of 10^®, the 
density contrast is about 10^® and the opacity can vary up to a factor of 10^^. 
Since at the main sequence a normal star remains in dynamical as well as in 
thermal equilibrium the inner and outer values are closely linked together by 
the boundary conditions which have to be simultaneously fulfilled. 

1.4 Interaction Between Matter and Radiation 

The interaction between matter and radiation leads to a momentum and 
energy exchange which can be non-local, coupling regions far apart. The 
importance of an accurate treatment of this interaction is easily illustrated 
by a number of astrophysical situations. Radiation drives stellar mass loss 
by acting on stellar atmospheres. Radiative pressure imposes the so-called 
Eddington limit which defines the maximum luminosity to allow a static 
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structure. Heating as well as cooling by photons are examples of the energy 
exchange between matter and radiation often accompanied with ionization 
fronts. The different versions as well as the derivation of the radiative transfer 
equations describing all these effects can be found in the lectures by Mihalas 
and are therefore omitted here. Stellar pulsations are caused by appropriate 
opacity changes, this so-called ^-mechanism uses part of the radiative flux 
through the star to support cyclic pulsations of the stellar envelope. Again, 
the details of this radiative interaction as well as the transport of photons 
determine the variability, e.g., the shape of light curves, colour changes and 
the amplitudes. 

A further complication inherent to radiation hydrodynamics comes from 
the frequency dependence of the above mentioned processes adding a new 
dimension of complexity to the problems. Without going into details, a certain 
averaging or integration over frequency is used to calculate the radiative force 
or heating (see lectures by Mihalas). In most applications Rosseland-means, 
/cr, or Planck-means, kr, are used, i.e., 

I dB^ ^ ^ , 
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which are nowadays available as tables in electronic form through the OPAL 
or OP data. OP data can be accessed interactively through: 
http : //vizier . u-strasb . f r /OP . html 



1.5 Moving Fronts 

Radiation fronts such as ionization zones in pulsating stars or expanding HII- 
regions are neither fixed in mass (Lagrangian description) nor in position 
(Eulerian description). Therefore, an accurate tracking of such features must 
relay on a numerical technique which tracks a number of grid points to such 
fronts. We will call such a method an adaptive grid. It redistributes a given 
number of grid points according to user-prescribed needs. 

Shock fronts can travel over large spatial distances clearly seen from the 
evolution of a supernova remnant. During the so-called Sedov phase of a 
strong point explosion the shock radius Rs obeys a self-similar expansion law 
(with n dimensionality of the problem), 

i?s OC <2/(2+n) . 

Depending on the chosen reference system, shock fronts can be almost sta- 
tionary within a flow but their strength varies in time, e.g., the accretion 
shocks due to different rates of infalling matter, the terminal shock of a stel- 
lar wind acting on the surrounding medium or a reverse shock propagating 
through the expanding clumpy SN-ejecta. 
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Radiative shock fronts have a more complex structure than ordinary hy- 
drodynamical shocks which can be described by a simple discontinuity. Ra- 
diation from the heated post-shock region can leave towards the up-stream 
region leading to a radiative precursor. The compression at the shock front 
may cause variations in the equation of state by opening new degrees of free- 
dom in a non-perfect gas or by triggering chemical reactions. Finally, the flow 
through the down-stream region can contain several relaxation layers where 
excited states, heated electrons, etc., need some time (or distance) to find to 
their equilibrium configurations. 

To summarize this brief introduction we point out that the inclusion of 
“realistic” physics leads to a large diversity of time and length scales, steep 
gradients as well as very complex structures moving through the medium. 
Based on these physical needs I present - at least in one spatial dimension - an 
implicit numerical method capable of treating such astrophysical problems. 



2. Basic Equations 

This section describes the general physical as well as some mathematical back- 
ground of radiation hydrodynamics without a formal derivation or lengthy 
discussion of the underlying equations. 



2.1 Radiation Hydrodynamics (RHD) 



The dynamical behaviour of radiation and matter is contained in the equa- 
tions of radiation hydrodynamics (RHD) which I state here in a simple form. 
Assuming small non-relativistic velocities, an Eulerian (i.e., fixed) coordinate 
system I refer to the standard literature for a formal derivation or general- 
izations to more complex situations (cf., Mihalas & Mihalas 1984). In order 
to demonstrate the numerical technique I can restrict the equations to their 
one-dimensional, spherical version. For the radiation terms I will take a LTEi- 
source function without scattering. Denoting the gas density by p, the radial 
velocity by u, the gas pressure by P and the internal gas energy by e, as well 
as the mass integrated up to the radius r by m. We obtain the first part of 
equations describing the gas : 

Equation of continuity (mass conservation): 



dp 1 n 

dt ^ dr 

Equation of motion (momentum conservation): 
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Equation of gas energy law of thermodynamics): 



( 2 . 1 ) 



( 2 . 2 ) 
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dpe 1 dr^ pen 1 u 

dt 9r dr 

-h 4:7rp{KjJ - KsS) - p{enuc-^^Q) = 0 , (2.3) 

where I have also introduced the terms containing the interaction with the 
radiation field, i.e., kh,j,s defined through Eqs. (2.5) as well as the nuclear 
energy production rate Cnuc* Details on these quantities will follow in the 
next section and the symbols uq and eg are the artificial viscous pressure 
and the energy generation, respectively (see Sect. 3.12). 

The effect of self-gravity is included by the Poisson equation in spherical 
symmetry. Instead of computing the gravitational potential we can calculate 
the mass m within the radius r entering directly the equation of motion (2.2): 

Poisson equation (self- gravity): 

m= f inpr^^dr' . (2.4) 

Jo 

The simplest way of taking into account the radiation field with a specific 
intensity /,/(r , n, t) can be achieved by integrating the radiative transfer equa- 
tion over frequency and solid angle di? = sin OdOdu again assuming spherical 
symmetry (cf., lectures by Mihalas), i.e., 

{J,H,K}Jr,t) = i (2.5) 



where /x = cos^ has been used. 

After some algebra and together with integrating over frequency ly we end 
up with the so-called grey moments of the radial specific intensity Ii,{r^p^t) 
Radiation energy equation (0^^ moment): 

IdJ 1 Ju 1 dr^ H ^ ^ ^ ^ 
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c r 

Radiation flux equation (1^^ moment): 



I dH 
c dt 



-h 

-I- 



1 1 dr^ Hu ^ 

c dr dr 

3K~J l^ldr^u 

1 

r c r^ or 



-h pK\iH = 0 . 



(2.7) 



As in any moment technique we are confronted with a closure problem 
since we have only two equations for the three moments, the mean specific 
intensity Crad — 47 t/c J, the radial radiative flux Frad = 47rif and the radi- 
ation pressure Prad — A closure condition is required to specify a 

further relation between the radiative moments; this condition is discussed 
in Sect. 2.3. 
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2.2 Coupling Terms 

Due to the momentum and energy exchange between matter and radiation the 
coupling terms enter the radiation hydrodynamicaJ equations with different 
weightings of the frequency-dependent opacity e.g., 

rOO ^ pOO 

= y H = J (2.8) 

In this example the radiative flux has to be known a priori to perform the 
integration (2.8). In many physical situations we have to iterate between the 
two equations (e.g., stellar atmospheres). However, there exist certain limits 
so that we can substitute these weightings by the Planck- or Rosseland means. 
For these means the integration can be performed once before any specific 
application. Such data are in principle available by tables (e.g., OPAL by 
Rogers & Iglesias 1992) or OP-project by Seaton et al. 1994). Nevertheless, in 
almost all RHD computations we set kh,j,s = since only /cr is tabulated 
(OPAL, OP) for different chemical mixtures. Physically it would be better if 
one could take the appropriate limits, i.e., /cr = and kj,s = «p- 



2.3 Closure Condition 



As seen before it is still necessary to close the radiative moment equations. 
This can be achieved by either calculating the full radiation field or much 
easier by applying the much simpler Eddington approximation. First, the 
Eddington factor (see lectures by Mihalas) is defined through 

^ (2.9) 

Jl/ 



as the ratio between the zeroth and second moment (cf., 2.5). Knowing this 
ratio obviously closes the set of radiative moments. 

Assuming a nearly isotropic radiation fleld at a point r, the specific inten- 
sity /j/(r, /i) is expanded up to the first power of the azimuthal angle 6 with 

fi = COS0, 

Iu{r,fj) = a^{r) +K{r)fi . ( 2 . 10 ) 

According to the expressions (2.5) we calculate the corresponding moments 
{J,H,K]Jr) = K(r) 

= 5a„(r-)} (2.11) 



and get therefore simply 



^ K.jr) ^ 1 
Mr) 3 • 



( 2 . 12 ) 



In the Ccise of linear anisotropy in (jl we can use / = 1/3 for our computations. 
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Another useful description of the Eddington factor is based on the work of 
Lucy (1971, 1976) where the radiation field of an extended static atmosphere 
in spherical symmetry is treated by a geometrical approximation. Denoting 
the stellar radius by , the radiation field at radius r is approximated by the 
two intensities and I~ which are discontinuous at the attenuation angle 

//, = v/(l - y^) , and ?/ = - . (2.13) 

r 

Hence, adopting the Heaviside function i/(x) = 1 for x < 1 and H{x) = 0 
for X < 0, the intensity is written as 

h{r,n) = i/(/i-/i.)/+ + . (2.14) 

Calculating moments (2.5) for such an intensity one can derive 

+ = (2.15) 

which reduces to Ju = 3 at the stellar surface r = when = 0. Follow- 
ing Lucy (1971), the moment equations can be solved for a static atmosphere 
yielding 

J, = H,[l)[W + ^f], (2.16) 

H, = HM)y\ (2.17) 



where a dilution factor W and a modified optical depth r are introduced 
through 
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Combining Eqs. (2.16), (2.15) the Eddington factor (2.9) is obtained 
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Applying the definition of /i* in Eq. (2.13) the last expression reads 



(2.18) 



(2.19) 
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having the limit of /», = 1 for large distances t/ 0, i.e., 1 and f ^ 0. 

Note that the derivative of ft, is discontinuous at the stellar photosphere, i.e., 
r and from the last expressions one finds dft,/dr — oo for y ^ 1. 

The shortcomings of such a simplification are discussed in more detail 
in Sect. 3.14, where also the technique is outlined how the static radiative 
transfer equation can be solved by a method based on characteristics. Nev- 
ertheless, for a first attempt to do radiation hydrodynamical computations 
it is strongly suggested to start with the / = 1/3 which avoids additional 
complications and reduces the CPU-time by about a factor of two compared 
to a full solution including the Eddington factor. 
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2.4 Opacity 

The stellar opacities are of fundamental importance for radiation hydrody- 
namical computations* Only recently the quality of these data has been im- 
proved and several long standing problems related to pulsational instabilities 
have been solved (see, e.g., reviews by Gautschy & Saio 1995, 1996). Based 
on very time-consuming quantum mechanical computations the Rosseland 
means for different chemical compositions (X, F, Z) (where X, Y, and Z are 
the mass fraction of Hydrogen, Helium, and heavier elements, respectively) 
are available at prescribed points of density and temperature, usually for 
R = p/Tq and Tq = T/10® K. Hence, we have to interpolate acr = kr(p,T) 
or acr = k>r{R, T) for our purposes and we need also smooth derivatives with 
respect to density and temperature in order to obtain rapid convergence in 
the Newton-Raphson iteration. The later is needed as a consequence of the 
imlicit formulation of our scheme (cf.. Sect. 3.4). 




Fig. 2.1. Part of a typicztl Rosseland opzu:ity k r used in radiation hydrodynEunics 
as a function of temperature T and R = p/Tq with Tq — T/10® K. This figure 
is a combination of two different sources. The full lines represent low temperature 
opacities from Alexander et ah, while the dashed lines show OPAL data (see text 
for details) 
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Figure 2.1 shows the low- temperature part of the Rosseland mean /cr 
of the opacity as a function of temperature 10000 > T/[K] > 2000 and 
R = p/Tq, The chemical composition corresponds to X = 0.7, Y — 0.28 
and Z = 0.02. This figure demonstrates a typical problem encountered in 
RHD computations, namely, that different sources have to be combined to 
one opacity table. The OPAL opacities are available for temperatures above 
6000 K. The low temperature part is due to Alexander et al. (1995) with a 
upper limit at 10 000 K but down to 1000 K including also molecular opaci- 
ties, In the overlapping region both tables are tabulated on different T, p and 
R values and have therefore to be interpolated. Although the differences are 
small one has to be carefully in order to avoid further artificial edges when 
joining such tables. In particular the low-temperature region is important to 
determine the atmospheric structure. In the case of extended or dynamical 
atmospheres, opacities influence, e.g., the shape of the light curves, the forma- 
tion of dust particles, the strength of the radiative force, and the density and 
temperature stratification in this transition region to the interstellar medium. 
Note, that these combined tables are then used for further smooth interpola- 
tions within the given data points as outlined in Sect. 5.1. Since these tables 
depend strongly on the chemical composition in stellar evolution calculations 
a set of different tables is needed to take into account the chemical evolution 
of the matter. 



2.5 Equation of State 



The thermodynamical properties of the fluid axe determined by the equation 
of state (EOS). In most astrophysical applications we cannot assume a perfect 
gas since ionization of atoms, dissociation of molecules, as well as degeneracy 
of electrons are usually encountered in radiating stellar plasm 2 LS. Hence, we 
have either to calculate the thermodynamic variables for every set of input 
parameters or we can utilize an EOS table which is less costly than direct 
computations. In Fig. 2.2 one of the thermodynamic adiabatic indices, namely 



p ^ _ d\nP 

P dp d\np ' 



( 2 . 21 ) 



is plotted against the temperature for a constant value of p = 10“^ gcm~^. 
This particular EOS was calculated by Wuchterl (1990) containing H, H 2 
and He with their thermodynamic properties and solar metal abundances. 
The graph of A reveals a number of prominent thermodynamic features and 
can serve as a test for the numerical evaluation of the derivatives used in 
relation (2.21). As discussed in Sect. 5.1 the interpolation is performed via 
rational splines which show some small oscillations in the vicinity of steep 
gradients and which can reduce the convergence rate during the Newton- 
iteration (cf., Sect. 5.3). 
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Fig. 2.2. The adiabatic index Fi plotted against temperature at a constant density 
of p = 10“® gcm”^ depicts the various thermodynamical effects entering a typiccil 
equation of state (see text for more details) 



In Fig. 2.2 the two horizontal dashed lines indicate the values of A =5/3 
and A = 4/3, respectively. The vertical lines are plotted to separate the var- 
ious thermodynamical eflfects entering this equation of state. In particular, 
at lower temperatures the properties of H 2 are important for stellar collapse 
calculations since the dissociation of H 2 is responsible for the so-called second 
collapse after which a hydrostatic core is formed (see also Sect. 6.3 for more 
details). At temperatures up to a few 100 K we find a perfect gas without in- 
ternal degrees of freedom, i.e., A = 5/3. Moving up to higher temperatures, 
the rotational and subsequently the vibrational excitation of H 2 reduces A 
significantly which is followed by the minimum of A caused by the disso- 
ciation of molecular hydrogen. Around 10 000 K the ionization of hydrogen 
starts leading again to a strong decrease of A* Within this ionization feature 
of hydrogen also the ionization zone of He I is located. Increasing the tem- 
perature helium becomes fully ionized as it is clearly seen in the figure. The 
ionization of heavier elements around 10^ K produces a small decrease from 
A =5/3 but up to a few million A remains constant until the degeneracy 
of electrons begins to push the adiabatic index against the ultra-relativistic 
limit of A =4/3. 
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It is clear that most thermodynamical features are accompanied by 
changes in the opacity tables. Notice that the numerical constants and the 
details of the thermodynamics used for the opacities need not be identical to 
those applied for the equation of state if they come from different sources. 
Hence, special care is necessary to check whether the energies at which these 
changes occur are exactly at the same values in both set of tables. Such a 
discrepancy between the equation of state and the opacity would lead to a 
number of unphysical effects. 

If nuclear reactions induce a change of the chemical composition a direct 
computation of the EOS might, however be unavoidable. Due to the conser- 
vative form of our RHD equations the EOS is needed at least in the form of 
two tables, namely P = P{p,e) and T = T(p, e). The density p by Eq. (2.1) 
and the specific internal energy e by Eq. (2.3) specify the thermodynamic 
basis in a compositionally homogeneous medium. 



2.6 Transport Theorem 

In order to formulate the physical equations on an adaptive grid we encounter 
the technical problem to restate the usual conservation relations of the ra- 
diating fluid on the moving coordinate system in such a way that they are 
still valid for an arbitrarily moving mesh and that they are computationally 
well-posed. To systematize notation we recognize three different time deriva- 
tives in an adaptive system. First, the Eulerian derivative djdt taken with 
respect to the fixed coordinates in space, secondly the Lagrangian deriva- 
tive D/Dt with respect a the moving fluid element, and thirdly the adaptive 
derivative d/dt taken with respect to the fixed values of the adaptive mesh. 
Note however that the physical quantities axe still the Eulerian variables. 

This change of coordinate systems can formally be understood by the 
Euler expansion formula. Taking a fluid element with an initial volume dVo 
and moving at velocity u we get the the volume at later time by 

dV = Jf dVo , and = Vu. (2.22) 

This expansion formula can be generalized to arbitrary motions by 

dV = Jg dl/o , with , (2.23) 

when d/dt is now the derivative with respect to a motion defined through 
the velocity is not defined through this equation (cf., 
Eq. (3.7) and Sect. 7.6). 
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In accordaince with the standard Reynolds transport theorem of fluid me- 
chanics we can obtain the so-called adaptive transport theorem (Winkler & 
Mihalas 1986) which is the formal basis of our adaptive conservative formu- 
lation of the RHD equations 



_d 

d^ 




[ %dV+ f dS 

Jv JdV 



(2,24) 



In Eq. (2.24) V denotes a definite volume corresponding to the fixed values of 
the adaptive mesh, dS is the outwards-pointing surface element of the surface 
dV , Note that setting = 0 or = u reveals standard Eulerian (fixed 

in space) or Lagrangian (fixed in flow) versions of the expansion formula. 



3. Solution Strategy 

The following sections describe the solution strategy of the RHD equations 
based on an implicit conservative discretization on an adaptive grid. The 
set of the discrete algebraic equations is then used throughout the compu- 
tations presented in the subsequent sections. Further improvements and new 
developments are discussed in Sect. 7.6. 

3.1 Integral Form of the RHD Equations 

Prom the work of Tscharnuter and Winkler (1979) it has become clear that a 
so-called compact form of the different terms is desirable to calculate radiat- 
ing flows under the influence of self-gravity. At that time a fully conservative 
formulation has been unknown and it has been difficult to find a stable dif- 
ference scheme. Following the results of Winkler and Norman (1986) and in 
the light of the other contributions presented in this volume it becomes clear 
that a finite volume discretization is essential to ensure physical solutions 
on a discrete space, in particular to obtain the right jump conditions across 
shock fronts and to get the correct wave speeds. Therefore, we have to refor- 
mulate the RHD equations to make them suitable for an adaptive grid based 
on this finite volume approach. 

To begin with we take a volume V contained within the surface dV and 
we integrate the equations over this time-dependent volume using the Gauss 
theorem to manipulate the divergence terms into flows across the spherical 
surfaces. 
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We end up with the following set of integrated RHD equations: 
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(3.1) 



(3.2) 



(3.3) 

(3.4) 



(3.5) 



(3.6) 



As mentioned in the previous section, additional relations between the 
physical quantities are required to close this system. We need the various 
frequency integrated and differently weighted opacities /cj(p, T), ks{p,T) and 
k>h(p, T), an equation of state P = P{p,e) and T = T{p,e) as well as the 
source function S = S{T) and the Eddington factor / — K/J. 

Due to the purely analytical manipulations the set of equations is equiv- 
alent to the standard notation as presented in Sect. 2.1. This integral for- 
mulation is used for the subsequent discretization and we can proceed by 
taking now the volume V to be between adjacent computational cells, i.e., 
the numerical method is based on so-called finite volumes. 
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3.2 Symbolic Notation 

We get the discrete version of the RHD equations by applying the rules which 
transform the differential operators into finite volume operators. Therefore, 
we have to define the various differences which will be used throughout the 
following sections for any physical quantity X defined at the grid point r/, i.e., 
Xi = X{ri). Note that also the position r/ is determined implicitly according 
to the grid equation (cf., Sect. 4.). 



symbol 


operator 


description 


SXi 


— — X^^^ 


temporal difference 


AXi 


= x\- Xi+[ 


spatial difference 


Xi 


= 0.5 {Xi -h X;_|_i) 


spatial mean 



3.3 Moving Coordinates 



Since the coordinates are not fixed in space we have to specify the relative 
velocity between the gas velocity and the grid motion. We use an arbi- 
trary coordinate system but do not transform the variables into this system, 
i.e., use an adaptive coordinate system but still Eulerian (or Lagrangian if 
the equations are formulated in Lagrangian coordinates) variables. We can 
now define the relative velocity by changes of the grid point position 



= u - = u - 



Sr 

Tt 



(3.7) 



According to the implicit nature of the grid equation the actual grid velocity 
-g gjygn only after convergence of the iteration at the new time level 
since the grid equation is solved simultaneously with the physical equations. 
Another definition of the grid velocity is presented in Sect. 7.6 but not im- 
plemented up to now in the current RHD-codes. 



3.4 Implicit Discretization 

In the following sections we introduce a difference scheme which allows a 
stable computation of radiating and self-gravitating flows, at least in one 
dimension. There exists no rigorous mathematical proof for the stability but 
in a large variety of examples we never encountered a numerical instability, 
in particular the adaptive mesh is very sensitive to such situations because 
a rapid oscillation of any variable will increase the clustering of grid points 
making the numerical difficulties even more visible. 

The unknown variables are defined on a so-called staggered mesh where 
scalars like density, pressure, energy densities are defined inside a computa- 
tional cell and vectors like velocity and fluxes are defined at the cell bound- 
aries (Fig. 3.1). Hence, we have to provide a description of how to transform 
the variables from the cell boundary to the cell’s interior and vice versa. Up 
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Fig. 3.1. Location of variables on a stag- 
gered mesh. Scalars are defined within a cell, 
vectors at the cell boundaries. The grid index 
I is increased inwaxds 



to now this is done by a simple mean between the left and right values and 
in the final version of the discrete RHD equations of Sect. 3.13 we will use 
this spatial mean for the gets momentum flux as well as for the radiation flux. 
A better but much more sophisticated way is presented in Sect. 7.6 which 
up to now has not been incorporated in any RHD code. Note, that a more 
complicated description rapidly increases the need for CPU~time for evaluat- 
ing the corresponding Jacobi-matrix (cf., Sect. 5.4). The whole computational 
domain is divided by N grid points and the grid index is running inwards, 
f.e., / = 1 defines the outer boundary at r\ = Rout whereas i = N gives the 
inner boundary located at = Rin- Note also that ri appears as a depen- 
dent variable since the location of an individual radius point is determined 
by the grid equation (Sect. 4.). 

Compared to the work of Tscharnuter and Winkler (1979), the key differ- 
ence to find the following conservative formulation comes from the fact that 
the implicit nature does not require a form of the equations like 



o 

— /)i + Vp/Wi = 0 (3.8) 

but any combination of pi with other variables is suited as long as the corre- 
sponding Jacobi-Matrix remains non-singular. 

An advantage of implicit schemes comes from the large time steps that can 
be used during the computations (see Sect. 5.3 for more details). All explicit 
numerical schemes like the ones described in the lectures by LeVeque and 
Muller have to obey for stability reasons the restrictive Courant-Friedrichs- 
Lewy (CFL) condition restated here in its simplest form for a grid with 
spacing of Ax, 



tcFL = 



min 

all cells 



Ax 

|u| + Cs ■ 



(3.9) 



The Courant number can be defined through 

A^cfl — (3.10) 

CCFL 

and any explicit scheme requires A^cfl < 1* 

With an implicit scheme, we have to deal with large nonlinear algebraic 
systems for the unknowns at the new time step. If Q denotes the discrete 
version of the differential equations one gets a system like 
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+g(X("),X<°)) =0 (3.11) 

and this system is typically nonlinear and has therefore to be solved itera- 
tively, e.g., by a Newton-Raphson iteration for the variables at the new 
time level. Usually the time step is not restricted but the convergence radius 
of the Newton iteration places some limits. Another condition on the time 
steps comes from accuracy considerations, e.g., the time step can be restricted 
to limit the allowed maiximum variation of variables as time advances. 



3.5 Time-centering 



The computation of oscillatory solutions requires special care for the temporal 
evolution, in particular the numerical damping in time has to be reduced. By 
separating the time derivative from the radial terms the system of equations 
is written symbolically as 

^ = m ( 3 . 12 ) 

and a Taylor expansion is performed to calculate the finite difference approx- 
imation. If Eq. (3.12) is substituted, 



h{t -h St) - h{t) 

It 






(3.13) 



it becomes clear that the temporal error scales with St. Repeating this pro- 
cedure for Ti one gets 



d n{t-\- St)- H{t) id^n 

dt^~ St ^ 2 dt^ 



(3.14) 



Now, substituting the last expression into Eq. (3.13) and collecting terms we 
end up with 



h{t -I- St) — h{t) 

It 



^[n{t + 6t) + n{t)] + 



id'^n 

4 



st^ + ... 



(3.15) 



and this expression goes with the second order of St. Hence, the accuracy 
order of the temporal discretization can be increased if the radial part of the 
equations is time-centered with 0 = 1/2, i.e.. 



(n(t)} = on{t -h St) -h (1 - e)nit ) . 



(3.16) 



If is a linear operator this is equivalent to center the physical variables like 



{X{t)) = 9 X{t + St) + {1 - 6) X{t) 

= + (1 O<0<1, (3.17) 

where 0 = 1 gives a fully implicit scheme. As demonstrated before the mean 
value of 0 = 0.5 leads to second order accuracy in time. However, in the 
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Ccise of nonlinear equations both procedures are not equivalent. Nevertheless, 
from numerical experience I prefer the time-centered variables and thus the 
code uses differential operator on time-centered variables rather than time- 
centered operators on physical variables. For stability reasons a typical value 
oi 6 — 0.51 may be necessary (see Feuchtinger & Dorfi 1995). 



3.6 Adaptive RHD Equations 



Based on the finite volume conservative discretization on an adaptive grid 
the basic form of all physical equations is given by 



dt 



L 



XAV 



+ 



L 



dA 



{Xs 



Asink) dV^ = 0 . 



(3.18) 



Depending on the physical nature of the source and sink terms their trans- 
formation onto the moving coordinates can be easily achieved by applying 
the adaptive transport theorem (cf., Sect. 2.6). The following sections explain 
in more detail how the different terms can be discretized in a stable man- 
ner. Since we are carrying the integrated mass m as an independent variable 
to obtain the gravitational acceleration we can use the temporal change of 
the integrated mass (divided by 4tv) to calculate the fluxes across the cell 
boundaries, 



Smi 

7T 







Sri 

It 



(3.19) 



where we have expanded all terms explicitly according to our time-centered 
description of Eq. (3.17) as well as our advection procedure as stated in 
Eq. (3.25). The derivation of the Jacobi-Matrix simplifies when taking Eq. 
(3.19) (instead of the direct use of r^^up^^u) and it has in general no influence 
on the numerical solution as long as the flow structure is well represented 
on the adaptive grid. An advantage of using Sm/St comes from keeping the 
discretization stencil 5-point for the momentum equation. The direct use of 
would result in a 6-point scheme due to the interpolation necessary 
on the staggered mesh. Note also that for clarity the explicit notation of the 
time-centered variables will be omitted in the following sections. 



3.7 Discretization of Gradients and Divergence Terms 



Starting with a gradient of the physical quantity X we can write the volume 
integrated values between the cell boundaries (/ -h 1,/) after a division with 
47T 



X 



VXdV 



I 



= rf /iX, , 



(3.20) 




3. Solution Strategy 281 



where the staggered mesh allows an intuitive physical interpretation since, 
e.g., a pressure difference Z\P between adjacent cells acts directly on the 
surface area r^. 

The volume integrated divergence is straight forward due to the Gauss 
theorem and therefore it is discretized as 



X 



VXdV 



= A{rfXi) 



(3.21) 



3.8 Diffusion 



Diffusion terms are not included in the basic set of RHD equations presented 
in Sect. 3.1 but clearly a number of astrophysical phenomena can be charac- 
terized by a diffusion process. Although modifying the nature of the differ- 
ential equations from a hyperbolic type to a second order derivative in space 
and by adding parabolic terms to the equations the implicit method can deal 
with such terms without major changes. In the case of diffusive terms, e.g., 
an explicit method suffers severely from the limits imposed on the time step 
by 



St < min 

all cells 



1 {AxY 
6 D 



(3.22) 



denoting the diffusion coefficient by D, i.e,, the time step has to be reduced 
by the minimum spatial resolution squared. Note that the exact factor in 
front of condition (3.22) depends on the discretization scheme used. Hence, 
a number of numerical methods employs an operator splitting technique (see 
also lectures by LeVeque) where the parabolic parts are solved by an implicit 
method still allowing the CFL time step of Eq. (3.9) for the hyperbolic parts 
of the equations. 

A particular example including diffusive terms is given in Sect. 6.1, where 
a few models are plotted describing the SNR-evolution with particle acceler- 
ation. In this case a diffusion term in the cosmic ray energy equation has to 
be added to the usual term of the energy equation. The following (volume 
integrated) diffusion term is discretized in spherical symmetry 



X 



VDVXdV 



i 



= A 




AXi ' 

Ari 



(3.23) 



and in more detail 

X, - Xf+i 
Ari Ti - ri+i 



(3.24) 



since the scalar quantity Xi is located inside a cell whereeis the radii ri and 
rj+i are defined at the cell boundary. I emphasise this point because taking, 
e.g., the mean value (Xi) defined at the “correct” position of ri can lead to 
some numerical instabilities as described already by Tscharnuter and Winkler 
(1979). 
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3.9 Advection 



Any numerical scheme for hydrodynamics different from a Lagrangian has 
to deal with the transport of quantities from one computational cell to the 
neighbouring ones. The corresponding physical terms are called advection 
and their numerical properties play an important role also for adaptive con- 
servative computations where this transport is done with a relative velocity 
instead of the usual gas velocity. The conservation form ensures that any loss 
or gain in one cell due to this transport is compensated by the same amount 
of gain or loss in the neighbouring cell. As demonstrated in the lectures by 
LeVeque and Miiller these advection terms can be discretized, e.g., by em- 
ploying a Riemann solver or some approximations to the physical nonlinear 
flow structure. The resulting flux over a boundary is reconstructed from the 
different fluxes carried by the various waves. The complexity of this proce- 
dure depends critically on the EOS and on the wave speeds. In the case of 
radiating fluids the velocity of light can place rather strict conditions on the 
time step. Furthermore, the nature of the various waves becomes more com- 
plex. Hence, up to now only the simplest “wave” is taken into account to 
calculate the advection terms in radiation hydrodynamics: The direction of 
the flow is used to construct the flux. Therefore, the advection terms for the 
quantity X will be discretized in spherical geometry by 

[ dA ~ Zi ^ u"®') (3.25) 

JdV ^ ^ 

for stability reasons X has always to be taken from up-stream. Depending 
on the accuracy the advected quantity X^^ can be obtained from 



- constant slopes within a cell, donor-cell 

- linear monotonized slopes, monotonic advection 

- piecewise parabolic slopes, PPM 

- or some other high order method. 



Without reviewing the vast mathematical literature on these points I want 
to state that almost all examples presented here rely on an upwind monotonic 
advection since together with the adaptive grid such an advection scheme 
provides in most cases the necessary accuracy. Therefore the advection can 
be written as 






ad 









if u[^‘ < 0 
otherwise 



(3.26) 



together with the monotonized slopes 



X,*** = Xi + 



AXi AXi-i 

0.5(X,_i-X,+i) ’ 

0 , 



if AXi AXi^i > 0 
otherwise . 



(3.27) 



This version of monotonized slopes is based on the work of van Leer (1977), 
where a slope within a cell is only introduced when there is a clear “trend” 
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in the same direction within the neighbouring cells. In the case of developing 
numerical oscillations one encounters a typical wave length of the instability 
of one cell size and the above monotonicity criterion will cut these oscillations 
through replacing the slope by a constant, f.e., the advection is truncated to 
the stable first order donor-cell scheme. This procedure guarantees also non- 
oscillatory solutions in the vicinity of shock fronts as long as the grid provides 
enough resolution to evaluate the shock transition correctly. An example of 
a small post-shock oscillation is presented in Fig. 4.6 which was caused by 
insufficient spatial resolution. 

Prom counting the indices it is evident that depends on the three 
points /-!,/,/ + !. The flux at location r/ is calculated from and X^^^^ 
depending on the direction of the relative flow and finally, the changes of 
Xi within a cell are given by the difference between the left hand boundary 
situated at n+i and the right hand boundary at r/. Hence, collecting this 
information the whole advection term based on such a monotonic description 
depends on quantities with indices Z — 2,/ -!,/,/ -h 1,/ + 2. This so-called 
5-point stencil enters then directly into the structure of the corresponding 
Jacobi matrix (Sect. 5.4). 

Prom the statements above it is also clear that up to now the non- 
uniformity of the grid as well as the geometric properties have not been taken 
into account for calculating the monotonized slopes but further remarks on 
this subject can be found in Sect. 7.6. 

3.10 Initial Conditions 

Starting the implicit computation requires the full solution of the system of 
RHD equations together with an appropriate grid distribution. This initial 
model must lie within the convergence radius of the Newton iteration other- 
wise the computation cannot be started. Since a number of problems attacked 
by implicit methods are mixed boundary and initial-value problems the de- 
mands on initial models can be very severe and special care must be taken to 
obtain suitable initial models as discussed in more detail in Sect. 4.10, where 
the chosen examples illustrate very different initial conditions. 

In the case of a protostellar collapse the initial model is simple because 
constant values for density, pressure and zero velocity in a Jeans-unstable 
fragment can be assumed. The grid is distributed according to a constant 
point concentration n = const. As the collapse proceeds the structures de- 
velop together with the adaptive grid as discussed in more detail in Sect, 6.3. 
The creation of hydrostatic initial models requires more advanced methods 
involving in most cases the numerical integration of the stationary stellar 
structure equations. As demonstrated in Sect. 4.10 the grid adaption can 
be achieved by a pseudo-time evolution. Such a dust-free hydrostatic initial 
model is used to start a stellar wind calculation where after a while a dust 
driven wind develops from a carbon-rich asymptotic giant branch (AGB)- 
star. The outer boundary is kept as a Lagrangian boundary for some time, 
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then the whole structure expands and when the outer gas reaches its final 
escape velocity the boundary is changed to an outflow boundary condition. 
The details of these models are described in Sect. 6.4. 

To summarize: it can be complicated to create initial models appropri- 
ate for implicit computations. RHD-problems are in general mixed bound- 
ary/initial value problems and it can be difficult to advance the solution in 
time with time steps longer than the CFL time step (3.9). Small fluctua- 
tions in the initial conditions can generate various waves which must then be 
followed on such small time steps. 

3.11 Boundary Conditions 

The boundary conditions for the RHD equations have to be specified together 
with the corresponding boundary conditions for the adaptive grid which axe 
presented in Sect. 4.5. Usually, the boundary conditions can easily be incorpo- 
rated into the staggered mesh as seen in the next examples. I want to empha- 
size at this point that at both ends of the computational domain boundary 
conditions are supplied in a way as to keep the structure of the corresponding 
Jacobi-matrix as regular as possible. For the computations a 5-point stencil 
is adopted and therefore any boundary condition relating two points either to 
the left for the outer conditions or to right for the inner condition is included 
without changing the overall pattern of the Jacobi pentadiagonal block struc- 
ture (see Sect. 5.4). A periodic boundary condition joins the innermost to the 
outermost grid points leading to a different structure in the Jacobi matrix 
which modifies the standard inversion procedure of the pentadiagonal block 
matrix. 

Starting with a simple case of fixed values Abound for the variable X the 
boundary conditions read as follows 

Xi^N - Abound = 0 5 (3.28) 

where the index 1 denotes the outer boundary and N the inner boundary, 
respectively. Clearly, this equation leads to an entry of 1 in the corresponding 
Jacobi-matrix and no problems are expected for the subsequent inversion 
during the iteration. At the innermost point a regularity condition is often 
encountered yielding equations like 

= 0 , or Xn- ^iv-i = 0 , (3.29) 

inner 

where the second expression represents the discrete version of the regularity 
condition. The case of a divergence free outer boundary at can be realized 
by 

Xj^v^ — X]^ —\r\j — 0 . (3.30) 

An outer Lagrangian boundary requires the solution of the momentum 
balance and an adaptive grid moving at the same speed ujsj. The full discrete 



dr 
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equation of motion is given in Sect. 3.13 and the boundary conditions for 
the grid are described in Sect. 4.5, Taking an external pressure of Pext? a 
total mass M, no mass flow over the boundary and expanding all terms, the 
velocity uat at the boundary is given through 

S{un Pn^Vn) + S{rriN~i) uff-i + Str%{Pext ~ Pn~x) (3.31) 

47r6tGM 4n5t — — 

+ 2 Pn^Vn Pn^Vn • 

c 

This equation corresponds to the finite volume discretization of the momen- 
tum equation containing also self-gravity and radiation pressure. Note that 
the radiation flux i/yv has to be determined from the radiative boundary 
conditions as described in Sect. 3.15. 

From the examples above it should be clear how the various boundary 
conditions can be included in the discrete version of the RHD equations. If 
the boundary conditions are not correctly formulated, then in most cases the 
Jacobi matrix will become singular during the Newton iteration, or at least, 
the time step will remain rather small. 



3.12 Artificial Viscosity 



As outlined in the previous sections the present numerical scheme for RHD 
computations does not make use of any kind of Riemann solver. Therefore, 
in order to treat shock fronts a so-called artificial viscosity is used in order 
to broaden shock waves over a few computational cells. This idea has been 
proposed by von Neumann & Richtmyer (1950) by adding an additional pres- 
sure Q in the vicinity of a shock so that one gets the correct jump conditions 
when integrating over the discontinuity. Their original work is restricted to 
plane geometry in the case of other metrics we have to use the geometry- 
independent formulation developed by Tscharnuter and Winkler (1979). In- 
stead of introducing a viscous pressure Q they generalized the concept by the 
viscous pressure tensor which is formulated in analogy to a physical viscous 
pressure. Following their derivation with a traceless tensor - allowing homol- 
ogous contractions without pressure generation - the pressure tensor Q, the 
viscous momentum transfer uq, and the viscous energy dissipation cq are 
given by 



Q = 



UQ ^ 



PQ 



(Vu) 



1 



V-uI 



-VQ, 



-Q-(Vu) , 
P 



(3.32) 

(3.33) 

(3.34) 



where (Vu) is the symmetrized velocity grcidient and I the unity tensor. The 
artificial viscosity coefficient ixq consists of a linear and a quadratic term 
weighted by q\ and q 2 , i.e., 
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^lQ = -qilyiscCs + g2^viscmin(V u,0) . (3.35) 

The definition of /zq include a typical viscous length scale / vise- Inspecting 
the last expression one sees that the linear term in / vise scales with the sound 
velocity Cg and is always present when q\ 7 ^ 0. Such a term has been included 
in a number of explicit computations to damp out small scale oscillations 
near contact discontinuities. For all computations I have done so far, such 
oscillations never appeared and therefore I set =0. The second term is 
quadratic in /vise ^nd it is evident that compressive and non-homologous 
motions, i.e., V-u < 0 and tr(Q) ^ 0 produce a viscous pressure. The 
amount of artificial viscosity is then determined by these length scales qi I vise 
and g 2 /visc 3 Jid the following table summarizes the main difference between 
adaptive and non- adaptive artificial viscosity formulations, namely that the 
adaptive grid clusters around discontinuities which enables much lower values 
of these length scales. In non-adaptive computations the viscosity has to be 
of the order of the typical cell size Ax and q 2 is set to the number of cells over 
which the shock front is smeared out. In adaptive and implicit computations 
a length scale /vise can be defined over which a shock should be broadened 
and then the grid shrinks around the shock front to resolve the structure 
having this typical thickness of /vise- Hence, the thickness of the shock front 
can be prescribed a priori and does not relate to the resolution Ax available 
in non-adaptive computations. Note, that in explicit computations such a fine 
grid around shock fronts reduces the time step through the CFL-condition 
(Eq.(3.9)). 



grid type 


viscosity scale 


shock scale 


non-adaptive 

adaptive 


qiMyisc - 
9l,2^visc ^ 19 


grid size 
“physical scale” 



According to the discretization scheme the artificial viscosity term are 
written like 

MQ,< = qmcs,i - {q2nf min oj - (3.36) 

including the linear part with Cg,/, the adiabatic sound velocity at r/. The 
corresponding discrete and volume integrated versions of uq and eg are 
given in the next section. Clearly, the viscous momentum transfer uq enters 
the equation of motion and the viscous energy generation adds to the gas 
energy equation. The action and smallness of this artificial viscosity can be 
inferred from a shock tube calculation depicted in Fig. 4.2 or from the SNR 
evolution where the three discontinuities moving with the ejected material 
are presented at one year after the explosion in Fig. 6.1. 

3.13 Discrete RHD Equations 

Adopting the previous notation and including the artificial viscosity terms 
the full set of discrete RHD equations is given in symbolic notation as a set 
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of nonlinear algebraic equations for all basic variables like p, u, e, 
and r at the grid index L The time-centering as stated in Sect. 3.5 
omitted for clarity. 



SjpiAVi) 

St 



6{ui piAVi) 
St 



SjpieiAVi) 

St 



Ami 

1 SjJiAVi) 
c St 



1 S(HipiAVi) 
c St 



+ A(rfpf‘^ur^)=0, 

+ A(-^-^^)+rfAPi 



+ 

4* 
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+ 



St 
47rGm/ 



47T . 



piAVi piAVi 

c 







rnad' 

Pit 



+ A[rfHi) 



+ -K[A{rfui) - ~{3Ki - J,) ^ AV, 
c c ri 

+ i^Rjpi {Ji - Si) AVi = 0 , 

+ ^ + rfAKi + 

1 



-h krj Hi Pi AVi + ~ HirfAui = 0 . 



An appropriate volume definition of 

^ - n+i) 



J, i7, m 
has been 



(3.37) 



3^^{wj«(^-|)}=0. (3.38) 

PiA{rfui) + 47TKR,/ Pi {Ji - Si) AVi 

= 0 , 

PiAVi=0, 



(3.39) 

(3.40) 



(3.41) 

(3.42) 

(3.43) 



as well as the integrated continuity equation has been used, i.e., 



Smi 



p2 ad,,rel 



(3.44) 



Note that due to the staggered mesh, e.g., the mean mass between two adja- 
cent cell pAV = -h pi-\-iVi+i) as well as other me 2 in values have been 

used for the gas momentum equations (3.38) and the radiative flux equa- 
tion (3.42). This system has to be solved numerically together with the other 
physical relations such as P = P(p, e) or kr = kr(p, T) and the adaptive 
grid derived in Sect. 4. 
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3.14 Radiative Closure Condition 

The numerical treatment of radiative transfer requires some additional re- 
marks. The radiation field is described by the first moments as presented in 
Sect. 2.1 and the representation necessitates an additional relation, e.g., be- 
tween J and K to close the system. This so-called radiative closure condition 
via the Eddington factor has been discussed in Sect. 2.3. The following sec- 
tions are devoted to the calculation of this closure condition from a solution 
of the time-independent radiative transfer equation (see lectures by Mihalas) . 
As pointed out in the introduction, for a number of astrophysical problems 
the time scale to adjust the geometry of the radiation field is short compared 
with the outer physical time scales and therefore a time-independent version 
of the radiative transfer equation can be used to calculate the ratio between 
the zeroth and second moment. However, the method outlined in the next 
sections can be easily generalized to handle also the time-dependence of the 
Eddington factor. The specific intensity /i.(r, //) obeys then the static radia- 
tion transfer equation in spherical geometry with 6 angle between radius and 
line of sight (/i = cos 6) 

introducing also the source function Si^. Clearly, the full solution of this equa- 
tion increases the dimensionality of the problem because the intensity It, (r, /i) 
depends on the radius r and the angle cos0 = /x. Although the next formu- 
lae look rather complicated the numerical treatment is simple and mainly a 
book-keeping of the different variables. 

Since Eq. (3.45) is a linear partial differential equation for /^(r,/x), the 
theory of characteristics can be utilized to reduce and solve such an equation 
by integrating along their characteristics (e.g., Yorke 1977, Balluch 1988). 
The characteristics are light rays along a constant impact parameter given 
by 

p = = const. , (3.46) 

also plotted in Fig. 3.2. The ray with impact parameter pj cuts different radii 
n at varying angles pij (Fig. 3.2). If the inner boundary of the computation, 
i7in> is not located at the stellar center a number of additional impact pa- 
rameters have to be introduced inside the innermost sphere defined by R\n- 
Hence, the total number of impact parameters is K > N, where N is the total 
number of radial grid points. Clearly, the values of the impact parameters pj 
are defined through the radial values r/ to avoid further interpolations on the 
p-grid. 

Transforming from (r, ^)-grid to (r,p)-grid the intensity can be calculated 
by integrating along these characteristics 

I, = + r dr; , (3.47) 

Jo 
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Fig. 3.2. The impact parameters 
pj are chosen such that their corre- 
sponding rays are tangent to the ra- 
dial shells. They intersect the shells 
r/, at angles whose direction 
cosines are pij = cos Oij. 



where the optical depth r*/ has been introduced through 




(3.48) 



Since the parameter p cannot distinguish between /i < 0 and /i > 0 we have 
to define the outward and inward intensities 



itir, ^l) = Iv{r, ±/i) , M > 0 . (3.49) 

To arrive at a discrete version of the above integral we assume a linear 
dependence of the source function Sy within a computational cell and drop 
V for the sake of simplicity in the following. The inward and outward IJ^ 
at the grid point r/ read as follows 



n,i 


= e j ' S{t' + Ti+i) dr' , 












= 5(t/_i - r') dr' , 

Jo 


(3.50) 



where the different indices / + 1 or / - 1 are caused by the left or right starting 
point of the integration. The optical depths are defined along each pj-path 
determining fi and given by 

At^ = KiA{nr)i , 

At~ = Ki-iA{ij,r)i-i . (3.51) 

For the second part of expressions (3.50) the discrete version based on the 
linear interpolation of S{r) leads to 
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1 Atj' - 1 + e 

2 ^ 






+ * 5 ^ + 1 - 



Z\r • — 1 + e ^'^3 



Si—^ 






At; 



+ 5,_i (l - ) 



1 - e (1 + At, ) 



+ Si-, 






(3.52) 



(3.53) 



In the case of small optical depths At^ 1 the weighting factors for the 
source function appearing in the last expressions should be expanded to in- 
crease the numerical accuracy. 



3.15 Radiative Boundary Conditions 

Specifying the outer radiation field by Iu,ext as well as the central source 
both specific intensities can be obtained by integrating Eqs. (3.50). 
Therefore the following boundary conditions are needed 

•^^(n,p) = h,ext{p) , 0 < p < ri , 

li'iP,?) = C(P.P)> < P < ri , (3.54) 

J^{P,P) = C(P.P) + /‘>/(p) ) 0 < p < . 

The first row of (3.54) allows for every impact parameter pj to start the in- 
wards integration of I~ until the inner boundary is reached. This can happen 
at r = 0 for all impact parameter pj > ri . In this case the regularity condi- 
tion on the axis (Fig. 3.2 and second line of 3.54) demands that the in-going 
intensity /“ must equal the outgoing intensity . If the inner boundary is 
not placed at the center r — 0 the innermost impact parameters cut through 
the inner sphere, i.e., pj < ri and a central source hj, is added to the incom- 
ing radiative field (third row of 3.54). The case of an optically thick central 
source added into a non-grey inner boundary is discussed in Sect. 6.2, where 
a non-grey radiative transfer calculation is performed on top of grey RHD 
models to obtain U,B,V colours of RR Lyrae pulsations. 



3.16 Eddington Factor 

After having obtained all intensities I^{riypj) the angle- dependence of the 
radiation field can be eliminated by an integration over all impact parameters. 
As inferred from Fig. 3.2 a fixed radius n cuts various impact parameter 
under different angles pij and therefore the integration over the angle p can 
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be replaced by an integration over the impact parameter p. Recalling the 
definitions of the radiative moments (2.5) the discrete versions reads as a 
summation over all pj with I < j < K: 



Ju{ri) 



HM) 



KM) 



\ (J+(m) + d/i = 1 aj (/+ + IJ^) , (3.55) 

^ Jo ^ 

\ r (/+(/.) - /-(^)) ’ (3.56) 

*'0 ^ j=l 

1 ^ 

\ f (/-(^) + C(,x))M"dM = (3.57) 

^ Jo ^ 



where and IJ^ denote the specific intensities at {ri,pj). The weighting fac- 
tors of the integrals ajb, bk and Ck are calculated fi:om geometrical arguments 
and regularity conditions on and I refer to the original work of Yorke 
(1980) for their evaluation. Finally, the discrete version of the Eddington 
factor fi/ at the grid point ri is obtained, 

(3.58) 



For the solution of the radiative transfer equation the outer boundary 
condition for the radiation intensity is given at r = ri through 

J(ri) = Jext H" T [-ff ( 7 * 1 ) + ^ext] 7 (3.59) 

with 






JHn) ’ 



(3.60) 



the ratio of the outgoing radiative flux to the outgoing energy density 
J+. 



4. Adaptive Grids 

As stated in the introduction a number of astrophysical problems require the 
capability of dealing with physical variables which vary over many orders 
of magnitude. The occurrence of steep and propagating fronts like shock 
waves or ionization fronts led to the idea to redistribute a given number of 
grid points according to these numerical needs (Winkler 1975). In particular, 
the protostellar collapse calculations have triggered this development since 
the evolution of the accretion shock determines the optical appearance of 
the collapsing cloud (see also Sect. 6.3). In contrast to the large number of 
adaptive grids which are used in engineering or aerodynamics and which are 
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so-called boundary fitted, the adaptive grids we have in mind are variable in 
time and depend on the physical solution. They are not generated in advance 
but they evolve in time as the physical situation changes its nature. Hence, 
the location of an individual grid point depends on the solution and on the 
resolution available. However, we have to ensure that the number of grid 
points is sufficient to resolve all structures. 



4.1 Basic Grid Properties 



Although the grid points are freely moving several requirements have to be 
fulfilled throughout the entire computation: 

1. monotonicity, i.e., < ri{t) for all t, 1 < / < AT, 

2. smooth variations in space and time, 

3. concentration of points at “important” places, automatic relaxation of 
the grid to a regular spacing in the case of smooth structures. 



The derivation of the following grid equation is based on these three points 
and they are automatically incorporated. Next we define the point concen- 
tration at index I by 



ni ~ , 

Xl - 



(4.1) 



where T/ denotes a typical (local) scale. In the simplemost cases we use a 
grid equation like 



Ui = const. 



(4.2) 



and take, e.g.. A; = 6 - a if we want an equidistant grid on the interval 
[a, b]. To be more flexible we can write, e.g., —xj and create a logarithmic 
equidistant grid. Summarizing these two example we see that specifying A/ 
tailors the grid in the case where no structures are present. Hence, we can 
use A/ to prescribe the grid locations, e.g., more zones at a boundary, in the 
center, or wherever. 



4.2 Desired Resolution 

The so-called desired resolution IZ contains all information what should be 
resolved during the computation (e.g., density, opacity, shock waves, ioniza- 
tion zones) and clearly depends on the problem to be solved. In what follows 
it is essential that 7^ is a positive quantity. Taking these two building blocks 
we formulate the basic form of the grid equation 

n oalZ . (4.3) 

The constant of proportionality will be determined by the total number of 
points and the separation of the boundaries. In praxi we use the path-length 
of a graph of f{x), i.e., 




dx 



2 
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TZ a 



ii + 



(4.4) 



because this gives a natural concentration at locations of steep gradients and 
the points are distributed uniformly in arc-length along the graph of /, It is 
then obvious that the discrete case of M functions can be generalized to 



Hi oc 







M 
j = l 



- a;j+i / 






(4.5) 



where the scaling factors Fij as well as different weighting factors gj for 
the M physical quantities have been introduced. As an initial guess for the 
weighting factors gj it is recommended to take a “democratic” approach by 
setting gj = 1 for all j. The spatial scaling factor A/ is defined through 
Eq. (4.1). This leads to our basic form of the discrete adaptive grid which 
fulfills already conditions (1) and (3) 



ni oc Hi , (4.6) 

i.e., the points remain monotonic for single valued functions f{x) and are 
concentrated at steep gradients due to the large variations of df/dx. From 
the numerical solution it becomes evident that the grid should not vary too 
rapidly in space and time, e.g., neighbouring cells should have comparable 
dimensions or move at almost the same speed. 



4.3 Spatial and Temporal Smoothing 



The grid equation as stated in Eq. (4.6) reacts immediately to changes in 
the physical variables and allows very abrupt changes of the cell size. From 
numerical stability and smoothness we want to avoid such a rapid increase 
or decrease of the cell sizes, e.g., the mass contained in two neighbouring cell 
should be comparable. Hence, we will limit variation of the point concentra- 
tion through the ratio between two zones 



Ota 



ag-\-l 



< 



ni 



< 



an 



(4.7) 

n/4-i ag 

where ag is a free parameter controlling the maximum permitted variation in 
cell sizes. The values for this spatial smoothing are typically around ag — 2 
which allows a shrinking or expansion of the cell size by about 30%. The con- 
dition (4.7) can be transformed by replacing Hi through a spatially smooth 
quantity like 



ni oc 






an 



Qq -h 1 



K-il 



(4.8) 
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Inspecting the above equation we notice that any perturbation occurring at 
IZj will be “felt” at a grid point ri by a decreasing influence, namely by 
a reduction factor of [[oLgl[otg + Hence, the further away a point 

concentration occurs, the less this concentration influences the local point 
concentration. To proceed we can replace the sum on the rhs by a difference 
on the Ihs 

fii-ni- ag{ag -h 1) (n/_i - 2ui + ni^i) <x Hi . (4.9) 

This differential version is preferable since also the discrete physical equations 
are written in such a local way that only the most neighbouring grid points 
enter the finite differences. A formulation like Eq. (4.8) couples all grid points 
together and fills up our Jacobi-Matrix which has to be inverted at every time 
step (see Sect. 5.3). In all computation presented in the following sections we 
have used — 2 allowing typical grid variations around 30% when going 
from one cell to their neighbouring ones. 

The temporal smoothing can be done in a similar way by damping the 
grid motions over an exponential time scale 

Tl{t) = [ TZ{t - ^O^xp f— ^ — , (4.10) 

Jo \ '^9 J '^9 

or in a discrete differential version 

= *1"’ + ft H”’ 

where the integral operator in Eq. (4.10) hcis be replaced by the correspond- 
ing discrete differential operator. Note that the difference operator used in 
Eq. (4.11) is the first order approximation to the differential operator be- 
longing to the Greens function defined through Eq. (4.10). In many cases the 
computation runs smoothly with an instantaneous grid adaption, i.e., Tg ~ 0. 



4.4 Grid Equation 



Now we can collect the previous terms and eliminate the constant of propor- 
tionality by equating at two neighbouring grid points the ratio between the 
spatially as well as temporally smoothed point concentration of Eq. (4.10) and 
the desired resolution Eq. (4.5) yielding the final version of the grid equation, 
i.e., 



Hi Hi -I 



(4.12) 



Several remarks should be added concerning this grid equation. The constant 
of proportionality can be eliminated in numerous ways, e.g., by the inverse 
of Eq. (4.12) or by the product nH, etc. However, according to numerical ex- 
periments the version presented in Eq. (4.12) leads to the smallest number of 
iterations although the other elimination procedures work as well. Remem- 
bering the previous derivation of the grid equation we see that the spatial as 
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well as the temporal smoothness of the grid variation requires most of the 
manipulation on the point concentration and from the usage of a discrete 
diffusion operator acting in the space of indices (4.9) and the exponential 
damping (4.11) the necessary property of monotonicity is also ensured. It is 
also clear that there may be other possibilities of get rid of the constant of 
proportionality by writing, e.g., 

s (!)=»• ” 

Again, from numerical experience such formulations may work but difficulties 
arise in the case of shock reflections at rigid boundaries, or in more general 
situations where the number of features to be resolved by the grid equation 
is changing rapidly with time. 

Another remarks concerns the grid points involved in Eq. (4.12) because 
at every index I the two left and right neighbouring grid indices enter, i.e., 
we arrive at a 5-point stencil, (t - 2, i - 1, i, i -h 1, i + 2) yielding at least for 
the grid equation a pentadiagonal Jacobi matrix (cf.. Sect. 5.4). Hence, we 
have to specify boundary conditions for two grid points at both ends of our 
computational domain. 



4.5 Grid Boundary Conditions 

Taking N grid points (a;i > X 2 > . . . > xn) the boundary conditions can be 
stated by demanding a constant point concentration at the boundary 

ni —U 2 , and un-i =nN -2 • (4-14) 

The innermost and outermost mesh points xn and x\ can still move according 
to physical requirements, e.g., fixed in space by specifying the old position 
equal to the new position 

. (4.15) 

A Lagrangian boundary is constructed by calculating the flow velocity, e.g., 
u\ at the outer boundary 

= 0, i.e., -h 5i (ui) , (4.16) 

where we have used the time-centered velocity (ui). In a fully implicit version 
we simply take . Note that at a Lagrangian boundary the velocity u has 
to be determined from the equation of motion. A moving piston with velocity 
wn ai the inner boundary leads to another typical example, 

= x^^ -h StwN . (4.17) 

All such kinds of boundary conditions can be combined together so that we 
can have, e.g,, a outer boundary fixed in space and a moving piston at the 
inner boundary over vice versa. However, we have to take care of the physical 
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situation: We cannot specify for example an outflow boundary condition for 
the physical equations and force the grid to move in a Lagrangian way (see 
also Sect. 3.11). 

4.6 Grid Motion 

As seen in the adaptive RHD-equations (Sect. 3.6) the grid motion enters 
through the advection terms and the transport of all physical quantities across 
the cell boundaries scales with Hence, the overall motion of individual 
grid points should be limited to avoid advection errors and in some cases the 
grid adaption can be optimized by a temporal smoothing. The temporal grid 
time scale Tg is then related to a physical time scale typical for the problem, 
e.g., Tg Tdyn- However, to allow a readjustment of grid points the grid 
scale Tg must be larger than the smallest physical time scale relevant for the 
physical system. 

We can restrict the temporal variation by introducing a local description 
of the damping, e.g., like Tg = Tg{r) if many orders of magnitude have to 
be covered during the simulation. Due to the diffusive nature of the spa- 
tial smoothing in Eq. (4.9) we have formally an infinite propagation speed. 
Any new feature developing somewhere in the flow causes all grid points to 
move towards this phenomenon and increase there the local resolution. It 
is therefore possible that small motions near the center can result in rather 
large motions at the outer boundary, in particular for the case of a logarith- 
mic grid as defined through oc x. Accordingly, the variation of u*"®* scales 
in the same manner, yielding large advection flows without any transport 
of physical material just by the displacement of grid points on the physical 
background. Such situations can partially be cured through the aforemen- 
tioned scaling with Tg = Tg{r). Nevertheless, a better handling of these rapid 
grid motions remains to be understood. 

4.7 Remarks on the Grid Equation 

The grid equation is always solved simultaneously with the physical equa- 
tions, so the location of an individual grid point is not known a priori but 
only after a successful solution of the coupled system. At each time step a new 
mapping between the physical space and the space of indices is constructed. 

We have outlined the basic properties of an adaptive grid but each spe- 
cific problem requires the appropriate definitions of the point concentration 
n, the resolution TZ together with the physical quantities to be traced (physi- 
cal variable, scaling and weighting). In addition we have to specify the spatial 
tolerance by 2 and the temporal variation through Tg related to typical 
time scales. Although the number of the parameters entering the grid equa- 
tion seems numerous their action is well defined and will be demonstrated in 
the subsequent simple problems for which also analytical solutions exist. 
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4.8 First Example: Simple Test Function 

To begin with, it is quite useful to see how a single test function like 

/(^) “ ^ ^2inh(a(x - b))] exp [-((a: - b)/c)^] (4.18) 

is resolved on the adaptive grid. The function contains a steep gradient de- 
fined through the hyperbolic tangent with a scale length a~^ 10~^ as 

well as a smooth variation contributed by a Gaussian term c = 0.2 both 
centered around b = 0.4 on the unit interval [0, 1]. Such structures are fre- 
quently encountered in RHD computations since the thickness of shock waves 
is usually set by the artificial viscosity length scale (3.35) to 10”^ of the lo- 
cal radius. For the purpose of demonstration f{x) is normalized to unity as 
seen in Fig. 4.1a. Therefore, further scaling is not necessary and the adap- 
tive grid with N = 100 points clusters near the steep gradients. In Fig. 4.1b 
the function (4.18) is plotted against the grid index i to illustr^-te how the 
spatial smoothing spreads the gradients within the grid. The ratio of the 
point concentration (4.1) has to be bound between Oigl{ag 4- 1) = 2/3 and 
(a^ + 1)1 OLg — 3/2 as shown by the two dotted lines in Fig. 4.1c. The reso- 
lution achievable can be inferred from Fig. 4. Id where the dashed line marks 
the resolution on a equidistant grid with Ui — 10^. The peak resolution cor- 
responds to an equidistant grid with about 15000 points. 






Fig. 4,1, A simple test function f[x) in the physical space x in panel a) and on 
the adaptive grid as a function of the index i in panel b); the ratio of the point 
concentration ni/rii+i as well as Ui illustrate the grid distribution used to resolve 
the steep gradients 
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At the boundaries the conditions avoid gradients in the point concentra- 
tion and the values of agree on both sides indicating that the total number 
of grid points is sufRcient to resolve all structures. A rough estimate on the 
number of grid points necessary to cover m orders of magnitude is given 
through 



In(lO) 

"”ln(l + lK) 



2.3 , 



(4.19) 



since the grid can concentrate at most by the factor 1 + 1/ag per computa- 
tional cell. 

The adaptive grid presented in Fig. 4.1 is obtained by relaxation of the 
initially equidistant points to their final positions and when f{x) is given 
by an analytical expression like (4.18) only the solution of a pentadiagonal 
matrix is involved during the iteration. More complicated grid distributions 
for starting RHD computations are discussed in Sect. 4.10. 



4.9 Second Example: Shock Tube Problem 

Since the review by Sod (1978) on numerical methods for pure hydrodynamics 
the so-called shock tube problem is used to illustrate the properties of new 
numerical techniques. The initial conditions are simple but discontinuous and 
for early phase analytical solutions are obtainable from Riemann’s theory. All 
variables are normalized and defined on the unit interval [0, 1] where at x = 
0.5 an initial discontinuity separates a hot, high density gas (p = 1, F = 1) 
from a cold low-pressure medium (p = 0.125, F = 0.1). The gas is initially at 
rest and satisfies an equation of state of a perfect gas with 7 = 7/5. At both 
ends a reflecting boundary is imposed yielding a number of nonlinear waves 
such as contact discontinuities, shock reflections, shock merging, reflection of 
a rarefaction wave, as well as the interaction of a rarefaction with a shock 
wave. 

Since all variables are already normalized and the problem is stated in 
plane geometry we can take a very simple choice. However, the performance 
of the grid depends on the value of the temporal smoothing Tg since we have 
to deal with reflections of waves at the boundaries. Taking N = 100 grid 
points we expect a mean spacing of Ax ~ 10“^ and peak velocities around 
u 1. From this two numbers we estimate a typical flow time across one cells 
of r 10“^ and therefore as a first guess we can take a = 10“^, i.e., ten 

times smaller than our physical time scale r to allow an almost freely moving 
grid. As demonstrated in Figs. 4.4 and 4.5 the choice of Tg is not so critical as 
long as Tg is shorter than the physical time scales. For clarity the “standard” 
grid parameter for the shock tube problem are repeated as follows 

fi = p , Fi = 1 , wi = 1 , 

/2 = e , F 2 = 1 , W 2 = 1 , 

A'i = 1 , ag =2, Tg = 10-3 
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Fig. 4.2. Shock tube problem at t = 0.23 calculated with N = 300, donor-cell 
advection and Tg = 10~^ 



The physical solution consists of two nonlinear waves travelling to the 
right and a rarefaction wave propagating to the left away from the initial 
discontinuity at x = 0.5. Figure 4.2 plots the variables [a) density, b) veloc- 
ity, c) pressure and d) internal specific energy] at ^ = 0.23 before the first 
interaction at the reflecting boundary and the three different waves can be 
clearly distinguished. Going from left to right we see the rarefaction wave, 
followed by a contact discontinuity and the shock wave at the rightmost posi- 
tion. Although this solution has been calculated with a very simple first-order 
donor-cell scheme (cf., Sect. 3.9) the adaptive grid is almost “sitting” on the 
structures keeping all advection errors small. The length scale of the arti- 
ficial viscosity is 10“^ and we find a shock thickness of this order without 
any oscillations. Due to the crude nature of our difference scheme the only 
deviations from the analytical solution occur at the edges of the rarefaction 
wave. However, despite of this nice-looking solution any high order advection 
scheme suitable for an adaptive grid is preferable since the numerical short- 
comings of the discretization scheme cannot be compensated by an adaptive 
grid, in particular for higher dimensions where the number of cells is more 
limited than in these simple problems. 

In Fig. 4.3 the paths of individual grid points are plotted and the develop- 
ment of the three waves can be followed by the increased density of the lines 
where the expansion wave is less pronounced than the remaining features. 
At t = 0,29 we see the shock reflection occurring at the right boundary and 
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Fig. 4.3. The adaptive grid for = 100 points in the shock tube problem with 
dig = 2 and Tg = 10“^. Every second grid point is plotted and the different waves 
appear as contour line concentrations. The horizontal features are caused by the 
shock reflection at the right boundary {t — 0.29) and the interax:tion of the shock 
wave with the contact discontinuity around t = 0.41 (see text for more details) 



during this time the shock becomes invisible for the grid so all points move 
away to resolve the remaining features. As soon as the shock leaves the wall 
the grid rushes back to pick up the variations in density and energy as forced 
by the desired resolution. The same situation happens at t = 0.41 where the 
shock wave meets the contact discontinuity and suddenly two structures can 
be resolved on the same fine grid. Hence, the points start to spread out and 
are pulled in again as the two waves separate later in time. This grid mo- 
tion is clearly noticeable as the horizontal features in the (x,i)-plot and their 
time scale is controlled by Tg. As stated before at such points the quality 
of the advection scheme determines the overall errors. In principle, it should 
be possible to avoid such rapid grid motions around the interaction of dis- 
continuities. Up to now only the paper of Winkler et al. (1985) dealt with a 
so-called asymmetric time-filtering in the context of their complex formula- 
tion of another adaptive grid technique where the grid motion posses some 
“memory”. Although the grid motion is disturbed by these interaction re- 
gions we have not encountered any problems during these phases. The whole 
evolution requires about 100 time steps and due to the implicit nature of 
the numerical technique the typical Courant-numbers (Eq. (3.10)) are in the 
range of Nqfl — 10^-10^. 
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Fig. 4.4. Same (a;,t)-plot as Fig. 4.3 but = 10 ^ to demonstrate the robustness 
of the grid equation (see text for more details) 



To study the effect of the temporal smoothing the grid parameter is in- 
creased to Tg = 10“^, the order of our typical interaction time scale. The 
corresponding (x, ^)-plot of the moving grid points is presented in Fig. 4.4, 
where we see how Tg controls the grid point motion during these interaction 
time scales around t = 0.29 and t = 0.41. The grid points can still trace 
the wave propagation but looking closely to the wave interaction we observe 
a small deviation in the propagation speed before and after the interaction, 
in particular for the contact discontinuity. This is caused by the grid points 
which cannot come back fast enough to catch the steep density and energy 
gradients. The larger value of Tg mainly affects the long term evolution of 
the contact discontinuity since the shock wave can steepen itself depending 
on the length scaled defined by the artificial viscosity. 

Figure 4.5 illustrates even more dramatically the action of temporal damp- 
ing with Tg = 10“^. This value is clearly too large for the shock tube problem 
and the space-time diagram reveals that the grid point motion can only partly 
follow the waves. Nevertheless, the numerical solution as plotted in Fig. 4.6 
shows some basic properties of the physical solution but the discontinuity as 
well as the shock wave are strongly smoothed and most of the grid points 
are still around the initial discontinuity at x = 0.5. At time t = 0.29 we can 
barely see the shock reflection at the right boundary. 

As seen from the solution in Fig. 4.6 the coarseness of the grid leads to 
some small post-shock oscillation because the length scale of the artificial 
viscosity of 10“^ is too small to broaden the shock sufficiently for typical 
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Fig. 4.5. Same (x,i)-plot as Fig. 4.3 and Fig. 4.4 but = 10 ^ being too large 
for the problem. The points are not tracing the features although the numerical 
solution plotted in Fig. 4.6 still resembles the physical solution 



cell sizes. In cases with smaller temporal parameters the thickness of the 
shock is determined by the viscosity length scale and the grid points are free 
to cluster as dense as needed to resolve this jump. 

From the previous figures and the discussion we can conclude that Tg must 
be as small as possible to allow an optimal grid distribution and grid motion. 
Plotting the numerical solution obtained on an adaptive grid, it becomes 
evident whether the grid points have been able to follow the structures. In 
most cases the computations run smoothly without any time constant, i.e., 
Tg = 0, in particular for all kinds of expanding flows. The choice for the 
grid parameter is usually rather easy and values oi gi = I are in almost all 
cases sufficient to resolve the structures. In the case of large differences or 
linear versus logarithmic variations of the physical variables the smaller limits 
can be more weighted than the broad variations. Usually, all weightings axe 
between 1 < < 5 for normalized variables. Although the adaptive grid is 

quite flexible one should avoid to load too many distribution functions. Prom 
numerical experience we can state that in general two or three functions 
such as density, energy and/or opacity are enough to obtain a spatially well 
resolved computation. However, collapse calculations like the formation of 
protostars are more complicated to perform. This will be discussed in some 
detail in Sect. 6.3. 





4. Adaptive Grids 303 




0,0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 

Distance Distance 



Fig. 4.6. Shock tube problem at t = 0.20 calculated with N = 100 grid points, 
donor-cell advection and an adaptive grid with a time constant Tg = 10~^ yielding 
a non-optimal mesh distribution 

4.10 Initial Grid Distributions 

Using an adaptive grid, we need an initial grid distribution which is already 
a solution of the grid equation (4.12). Hence the physical variable have to be 
known at the locations of the grid points which are determined by an implicit 
description and whose positions depend on the physical variables themselves. 
In particular, the construction of appropriate hydrostatic initial models suited 
for our adaptive grid has caused problems (cf., Dorfi & Gautschy 1989) and 
requires some care. The basic numerical strategy consists of three steps and 
the initial grid distribution is created by a pseudo-time evolution. First, an 
initial physical model is constructed either by taking an analytical solution or 
by integration of a simplified system of time-independent ordinary differen- 
tial equations. Second, the positions at which the initial model is evaluated 
are, e.g., equidistant or come from a self-adjusting integration step size of 
the integrator used for the ordinary differential equations. In general these 
point locations are not in agreement with the demands imposed by the de- 
sired resolution of our adaptive grid. Therefore we can keep the initial phys- 
ical structure fixed in space and apply a linear interpolation to estimate the 
physical values at the positions in between the initial points if the points are 
sufficiently dense spaced. To summarize this step we need all M functions 
/jW. 1 < j < M entering our desired resolution (Eq. (4.5)) to be given at 
all locations x of our computational domain. Third, we start the pseudo-time 
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evolution and let grid points move to their final positions by solving a penta- 
diagonal matrix. Due to the fact that fi — f{xi) are fixed at all points xt 
during the pseudo-time evolution, the block-pentadiagonal matrix is reduced 
during this relaxation to a much simpler penta-diagonal matrix necessary to 
iterate the nonlinear grid equation. Note that the time constant Tg applied in 
this pseudo-time evolution is not related to any physical scale or to a value 
of Tg used in the subsequent dynamical calculations. 

As an example let us consider the initialization of a stellar model. Starting 
with a hydrostatic initial model the full set of RHD equations is reduced to the 
static case. For this purpose all time derivatives as well as the gas velocity are 
set to zero leading to the following system of ordinary differential equations 



(ODEs) 
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Augmented by the appropriate constitutive relations the static RHD sys- 
tem is integrated numerically using a standard integrator for ODEs. Such 
an integration has to begin at the photosphere where total mass, effective 
temperature and luminosity are initially specified for the mass equation, the 
radiation momentum equation and the radiation energy equation. An esti- 
mate of photospheric gas pressure provides the initial value for the hydrostatic 
equilibrium and is given by the conditions of a grey atmosphere. This esti- 
mate has to be determined iteratively until a given outer radius is reached. 
At this point the boundary condition for the radiation field yields an external 
temperature. A comparison between the computed and the desired external 
temperature then improves the new value of photospheric pressure and so 
on. Normally this iteration converges within less than 20 steps. If a variable 
Eddington factor / has to be used an additional iteration becomes necessary 
where / = 1/3 can be taken for an initial guess. The resulting structure is 
improved by a new /-profile obtained by solving the static transfer equa- 
tion (cf., Sect. 3.16). The spatial derivative of / in Eq. (4.22) is computed 
by means of a one- dimensional rational spline interpolation. After 3-4 itera- 
tions the structure of the atmosphere remains unchanged within an relative 
accuracy of 10“^. The solution of the static radiation transfer equation is 
also needed to specify the outer boundary conditions for the radiation field 
through J = where p, reflects the angular intensity distribution of the 

radiation field. Applying the Eddington approximation one obtains /2 = 0.5 
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Fig. 4.7. The grid relaxation towards an initiaJ RR Lyrae model. Every 4th grid 
point is plotted as a function of the number of time steps (see text for more details) 



when neglecting a stellar atmosphere. In the case of extended atmospheres 
/2 = 1 is the limit for free streaming of photons. 

The subsequent relaxation of the grid points on the computed initial con- 
ditions is clarified by inspecting Fig. 4.8, where the position of every forth 
grid points is plotted during such a relaxation procedure. After integration 
of a hydrostatic RR Lyrae model we determine the physical variables to be 
tracked during the subsequent computations which in this case are the den- 
sity by log(/>), the internal energy by log(e), the opacity by log(«:) as well 
as the linear function of the thermodynamic index Vad- According to our 
notation we have M = 4 and we must also specify the weighting for these 
quantities. We take ^ 1 , 2,3 = 1 and since Vad has a small absolute variation 
(however, these variations are very important to calculate the driving zones), 
we take = 5. The relaxation procedure is initiated by a small time step, 
i.e., St Tg and we use Tg = I which determines the pseudo-time. At the 
beginning, the time step adjusts to a value where the nonlinear iteration is 
still converging and will increase to values when all grid points have 

reached their final position. In Fig. 4.7 this grid motion is illustrated by plot- 
ting the radial position of every 4th point out oi N = 200 as a function of 
the number of time steps and we see how the points cluster around the steep 
gradients occurring around the ionization zone. 

The typical point concentration reaches at the ionization zone values of 
n = 10^ meaning that the relative resolution is 10~^ of the local radius. 
This grid feature is depicted in Fig. 4.8, where only the temporal evolution 
of the radial section is plotted around the ionization zone. According to the 
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radius [10^^ cm] 

Fig. 4.8. The grid relaxation towards an initial RR Lyrae model around the ion- 
ization zone. Note the change in scale at the x-axis. Every 4th grid point is plotted 
as a function of the pseudo-time (see text for more details) 



variation of the pseudo-time on the y-axis for t < 1 the rapid grid motion 
towards the ‘Important” structures becomes clearly visible whereas at t > 1 
the grid point paths begin to steepen indicating that the grid points are 
approaching their final locations. This division must happen around t Tg 
since for time steps St > Tg the grid becomes less and less influenced by the old 
time level. This plot nicely illustrates the grid motion without any crossing of 
grid lines as enforced by the derivation of the grid equation. Applying such 
a relaxation procedure we can always create initial grid distributions from 
equidistant mesh points. 

The construction of an initial grid leads to some small interpolation errors 
in the physical variables as explained by the following example. If we take 
the mass conservation in spherical symmetry we recall the simple differential 
equation (4.20) 

^ = 47rpr^ , (4.24) 

which can be integrated with any integrator for ordinary differential equa- 
tions. However, the discrete integrated version of this equation reads in our 
staggered mesh notation at the grid point ri 

47T / S \ / \ 

mi - mi^i = —pi [ri ~ . (4.25) 

Both versions agree to first order but the integrator solving Eq. (4.20) usually 
adopts some internal representation of the differential operator like, e.g., a 
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Runge-Kutta integration. It is evident that the mass integration based on 
Eq. (4.20) and our discrete finite volume version (4.25) can disagree by a 
small fraction which depends on the number of grid points, on the accuracy 
of the integrator as well as on the location of our grid points. For stellar 
initial models this total mass difference between the two versions is typically 
less than 10“^ and must decrease if N the total number of grid points used 
for the dynamical computations is increased. This discrepancy between the 
system of ordinary differential equations and the time-independent limit of 
the discrete RHD-equations is unavoidable. 

To summarize the above discussion which focused on the mass equation: 
We have to be careful when constructing hydrostatic initial models since the 
hydrostatic equilibrium is determined by a particular numerical representa- 
tion of the ODE-solver. In order to start with an initial model being strictly in 
hydrostatic equilibrium (as defined through our discrete set of equations) we 
therefore begin our computations by applying the full set of time-dependent 
RHD-equations on the initial niodels but take a large time step. This proce- 
dure can be used to check the quality of the initial model because if the model 
is stable and close enough to equilibrium the time step increases immediately 
and all remaining small velocities are damped out. On the other hand, if the 
initial model created is not well represented on the adaptive grid the time 
step remains small without reaching the time-asymptotic solution. 



5. Further Computational Needs 

5.1 Rational Spline Interpolation 

Since our numerical method uses an implicit discretization (cf., Sect. 3.4) 
we obtain a nonlinear system of algebraic equations that must be solved by 
some iteration. This is discussed in Sect, 5.3. During such an iteration not 
only the actual physical variables but also all functions like the EOS, opac- 
ity, Eddington factor have to be known, i.e., P = P{p, e), T = T{p, e), etc. In 
particular, for any iteration scheme based on a Newton-Raphson procedure 
also the derivatives of these quantities with respect to the unknowns must be 
provided and have to be smooth to allow for a rapid convergence. In the case 
of analytic expressions this task can be simple but for most “real” astrophys- 
ical problems the needed relations are given as tables which have then to be 
interpolated together with smooth derivatives. Due to the usual spareness of 
the data points special care is necessary to avoid spurious oscillations around 
steep gradients. 

Prom experiences with a large number of numerical computations with 
tables, I can strongly recommend the so-called rational splines as discussed 
in Spath (1983). In the case of 2-dimensional interpolation of a function T 
given at locations X{ and the values in between are approximated by the 
cartesian product of two rational base functions g{x) and g{y) , i.e., 
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In principle the weighting q can be different at every interval and there is 
no general adopted value for this parameter. Usually, one has to try several 
values of g, normally around q = 1 and one has to look at the variations of 
T but also at the smoothness of the derivatives dT jdx and dTdyy which are 
needed during the iterations. At each grid point {xiyyj)^ 16 values of aijki 
have to be calculated in advance and stored for each function to be used 
by the computation. Some hints for optimizing this interpolation are given 
in Sect. 5.2. Note, that g = 0 yields the standard cubic splines. A typical 
example of 1-dimensional interpolation through rational splines is illustrated 
by the cooling curve (Fig. 6.4) used in the SNR computations. 



5.2 CPU-Time Requirements 

The CPU-time necessary clearly depends on the problem and so I want to 
make only some general remarks for the present implicit method. The main 
CPU-time consuming operations are: 

- the opacity and EOS and their derivatives with respect to the physical 
variables X either by interpolation or through direct calculations, 

- the Matrix inversion which scales like N when M denotes the number 
of unknowns per grid point and N is the total number of grid points, 

- the overall Newton- Raphs on iteration which takes typically a few iterations 
to converge, 

- the total number of time steps to cover the long term evolution of the 
astrophysical system but the time steps are usually much larger than the 
CFL-time steps of explicit computations, 

- and in cases where the Eddington factor has to be obtained from the ra- 
diative transfer equation the total CPU-time is usually doubled. 

Nevertheless, most 1-dimensional problems can be solved these days on 
fast workstations with CPU times ranging from minutes in the case of SNRs 
(Sect. 6.1), up to hours or days in the case of long-term pulsational calcula- 
tions (Sect. 6.2) or dust driven winds (Sect. 6.4) and weeks for protostellar 
collapse calculations (Sect. 6.3), where also a convective energy transport has 
to be included. 
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5.3 Iteration Procedure and Matrix Inversion 



By denoting all unknown variables at grid index I by 






(«) \ _ j(n) (n) 






} TV 
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(5.2) 



the system of the discretized nonlinear equations (Sect. 3.13) together with 
the boundary conditions (Sect. 3.11) can formally be written at all grid points 
ri 



GmiiXi)) = 0 , 1 < m < M , (5.3) 

which includes time-centering (Sect. 3,5) between the new time level and 
the old one Clearly, we have St = This nonlinear algebraic 

system (5.3) contains 



M 

N 



equations (# of unknowns per point) 
grid points 



I M X iV unknowns 



and they are given implicitly, i.e., they cannot be calculated by a more or 
less complicated extrapolation from the old time level. 

Such a nonlinear system is solved iteratively with a Newton-Raphson 
iteration, i.e., the system is expanded up to first order in a Taylor-series 
around the new estimate 

^(J^(i+1)) ^ = 0 , (5.4) 

where the Newton-iteration assumes that is already a solution of the 

nonlinear system, i.e., ^(X^*“^^^) = 0. The right part of this expansion (5.4) 
is used to calculate the correction 

SX = - X^^ (5.5) 

by inverting the linear system 

<5^ = -(H) (5.6) 

dg 

where is the (A^Mx ATM)- Jacobi matrix of the algebraic system (5.3). 
oX 

This procedure is repeated until 



min 

l,m 



]SXm,l\ 

m, /| VruJ 



< e 



(5.7) 



is reached for all variables and indices. We introduced r}m,i to specify a lower 
boundary for variables which can become zero, e.g., for the velocity a fraction 
of the sound velocity can be used to set an upper iteration accuracy, i.e., 
; = 10“^ Cs,/« In most cases the Newton-Raphson iteration converges in 
less than 5 cycles to an accuracy of e = 10~®. In principle, the Jacobi matrix 
is a (ATM x A'M)-matrix but most entries are zero due to the 5-point stencil 
adopted for the discretization scheme. 
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Since system (5.6) must be solved iteratively it can happen that no com 
vergence is found. Assuming no coding error in the vast number of derivatives 
to be calculated, either the inversion procedure can become ill-conditioned 
or the time step is too large yielding starting values for the iteration too 
far away from the solution at the old lime level, a case being discussed in 
Sect. 3.5. 



5.4 Structure of the Jacobi Matrix 



Since the nonlinear algebraic system (5.3) is generated by the discretization 
rules the corresponding (ArMxA/^M)-Jacobi matrix must reveal the discretiza- 
tion scheme having a very specific structure. The matrix is mostly empty since 
only the variables at the neighbouring grid points (/ — 2,/ — l,/,/-hl,/-l-2) enter 
the discrete equations. In particular, the matrix has a block pent a- diagonal 
form since a each grid index 1 < / < one gets rows of 5 (MxM)-submatrices 



dGmJ dQm,l dQm,l dQm,l dQm,l ^ 



\ 



and, e.g., one to of these (M x M)-submatrices looks like, 

/ dQi^i dQij dGi,i \ 



dG: 



TYl.l 



— 1 



dxx^i-i dx2,i-i dxs^i-i 

dG2,i dG2,i 9G2 ,i 

dxij-i dx2,i-i dx3j-x 



= Aij-i . 



(5.8) 



V : : : ••• / 

Adopting this last abbreviation, and recalling the notation (5.5) for 

the corrections SXt to be added to the values Xi 

6Xi = • • ) , (5.9) 



the system (5.4) can be written like 



Ai^i-2^Xi-2 + Aii^iSXi-i + Ai^iSXi -{- 

+ Ai,t+iSXi+i + Au+2SX,+2 = -e(x/‘^) , (5.10) 

revealing the penta-diagonal structure. Note that all A’s are (MxM)-matrices 
and all X’s are M- vectors. The inner boundary conditions I — N imply 

Ajv,N+i = ^yv,iv+2 = 0 , (5.11) 
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whereas the outer boundary I = 1 yields 

Ai,_i = Ai,o = 0 . (5.12) 

This special structure allows an easy inversion of the Jacobi matrix (5.6) by 
means of a recursion which leads to an upper triangular block system of the 
form 

SXi = Ui + Vi 5 Xi^2 + , (5.13) 

where after the forward substitution in (5.10) the following expressions ap- 
pear: 

Yi = ( [Ai^i-iUi-2 + Vi-2 + A,.,) , (5.14) 

Ui = Yi [ [Ai^i-2Ui-2 + Vi_i + , (5-15) 

Vi = ViA/,i+2, (5.16) 

wi = Yi [- — [A 1 J- 2 U 1-2 + Ai^i-i]wi-i-Ai^i-2'Wi-2] -{5.17) 

From the last expressions it becomes clear that such a triangular block form 
is obtained by inversion of the (MxM)-matrix (5.14) in each row of the ma- 
trix (5.8). Starting with I = 1 due to the aforementioned boundary conditions 
(5.12) one gets Yi = } and t/i, Vi and wi can be calculated. Proceeding 

until Z = is reached the boundary conditions (5.11) leads to Ui^f Vat = 0 
and V)v-i = 0 and SXfsf = wn follows immediately. Now the back substitu- 
tion in recursion (5.13) is performed to compute all necessary corrections SX 
to the physical variables X. These iterations are repeated until the desired 
accuracy (5.7) is achieved, or until it becomes evident that the iteration will 
diverge because the new solution is too far from the first estimate. 

The numerical inversion of large matrices can cause problems, in partic- 
ular when wide ranges in the variables are expected. As stated in the intro- 
duction almost all astrophysical objects cover many orders of magnitude and 
it is therefore necessary to normalize the equations before the inversion of 
the Jacobi matrix is performed. From numerical experience I can recommend 
the following normalization factors Gm,i for the discrete set of the six RHD 
equations (see Sect. 3.13.): 

with 

ym,i = {pi, /3/max(|ui|,Cs./), Piei, mi, Ji, max(|ff,|, J;) ). (5.19) 

Hence, at each grid point all RHD equations are divided by the “conserved” 
quantity at the old time level or when the velocity ui or the radiative flux 
Hi can become zero or negative by some physical limit. After such a nor- 
malization the inversion behaves more stable and the temporal variation of 
these “conserved” quantities (5.19) will be around one. The grid equation is 
already normalized through the physical scaling factors. 
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5.5 Time-Step Control 

After a successful iteration the relative variation between the two time levels 
is calculated at each grid point ri for all M variables {xij,X 2 ,iy ■ yXMj) 

Sm = maxSrn.i = — — • (5.20) 

These values Sm are used to determine the next time step St from the old 
time step 6t^°^ by the simple condition 

f if 5m > 

dt = , if (5.21) 

[ , if , 

where is the maximum allowed variation in the variable Xm- Typical 
values are = 10“ ^ As inferred from conditions (5.21) the next time 
step is reduced by a factor of 2 in case of larger variations but only increased 
by a factor of 1.5 in case of smaller variations than s^^^/2. Such a hysteresis 
in the time step control avoids oscillations in time step sizes. Of course, in 
case that the Newton-Raphson iteration diverges, the time step has to be 
decreased until convergence is achieved. 

The total number of iteration can be reduced usually by one if the starting 
value of the Newton-iteration (5.6) is obtained through an extrapolation 

from the last two time levels, i.e., 

^ , (5.22) 

where the ratio of the new estimated time step (5.21) to the old time step 
enters and denotes the solution before last. 



6. Computational Examples 

6.1 Evolution of Supernova Remnants (SNRs) 

The hydrodynamical evolution of Supernova Remnants (SNRs) may serve 
as an astrophysical example where a number of tests can be performed on 
the quality of the numerical computations. We have to deal with explosive 
motions with high Mach numbers and for later times the self-similarity of the 
pressure driven expansion allows a direct comparison with analytical solutions 
(Taylor 1950, Sedov 1954). At the late stages of the SNR evolution radiative 
cooling of the expanding shell becomes important and the high compression 
ratios occurring then in the dense shell demand locally a high resolution to 
follow their development. Finally, since SNRs are considered to be the most 
likely sources of galactic cosmic rays (e.g., Axford 1981) the contribution of 
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these high energy particles to the overall dynamics of the interstellar medium 
can only be determined by numerical computations. 

The computations presented are performed with N = 300 grid points 
distributed between rjsf = 10^^ cm and ri = 10^^ cm. The adaptive grid 
contains the density as well as the internal gas energy, the time smoothing 
Tg can be set to zero but the spatial scaling has to be logarithmic and the 
physical scales are defined through the harmonic mean to prevent the grid 
from “tearing” at the discontinuities, i.e., 



/l.i 


= Pi > 
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1/2, 
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= 1, 


/2,/ 


= ei , 




II 

-f 


1/2, 


V)2 


= 1, 






ag 


= 2, 




^9 


= 0. 



The energy of the supernova of Esn = 8 x 10^® erg is deposited purely 
as thermal energy within a radius of 10^^ cm. The density structure of the 
progenitor star is very simple; a constant density up to 10^^ cm and then 
an exponential decrease towards the external density of next = 5 cm”^. The 
external material is at rest and the pressure is given by the parameters of 
a rather warm interstellar medium of Text = 20000 K. The ejected mass 
Mej = 3 Mq fixes the mean density of this simple progenitor star but the 
detailed structure of the progenitor star is not important for the further 
evolution of the SNR. This particular example includes also the acceleration 
of energetic particles at shock waves as discussed later and the external cosmic 
ray pressure is set equal to the external gas pressure, i.e.. Pc, ext = Pg,ext* 
Note, that the models presented in the next figures are calculated including 
the acceleration of high energy particles at the shock waves. As outlined in 
Sect. 4.10, the initial grid distribution can be obtained from the pseudo- time 
evolution. 

At the beginning the evolution of the remnant is described by a simple 
ballistic expansion where the external medium has almost no influence on the 
motion of the ejected material. Nevertheless, the flow configuration contains 
several interesting features. First, the ejected material is separated from the 
interstellar medium by a contact discontinuity. In front of this discontinuity 
we expect a strong shock wave propagating through the external medium. 
Since the expansion works against the interstellar pressure the ejected ma- 
terial slows down and the so-called reverse shock decelerates the SN ejecta 
although the whole structure is moving outwards with more than 10^ km s”^. 

Figure 6.1 shows such a flow configuration at one year after the explosion. 
In order to show the resolution of the adaptive grid every grid point is plotted 
in range between 0.5 < r/[10^^ cm] < 1.4. The contact discontinuity is 
located almost at 10^^ cm seen by the drop in the density (Fig. 6.1a). In 
front of it the external shock wave becomes visible a the velocity jump of 
15 000 km s”^ (Fig. 6.1b) with an increase of the gas pressure by more than 
6 orders of magnitude (Fig. 6.1c), the corresponding Mach number is around 
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Fig. 6.1. A typical SNR shock configuration after one year, running into a constant 
interstellar medium 



M = 1000. Behind this forward shock the spherical geometry causes the 
sharp decrease of the density towards the contact discontinuity. Inside the 
exploded star we have a linearly increasing velocity field up to the point 
where the reverse shock decelerates the expanding material as depicted in 
Fig. 6.1b and the Mach number of the reverse shock is M = 140. In Fig. 6. Id 
the relative spacing log[r/zir] demonstrates the clustering of the grid points 
in the vicinity of steep gradients. 

This evolutionary stage lasts until the ejected material becomes compa- 
rable to the swept-up interstellar material, i.e., 




During the ballistic phase the supernova energy is purely kinetic energy 
and assuming a linear velocity field inside the ejecta we can define a typical 
ejection velocity through 



Uei — 



' IOF^sn 
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A typical sweep-up time scale is then given by 

. -^SW 

^SW 5 



(6.3) 





6. Computational Exaunples 



315 




Fig. 6.2. Space- time diagram of the grid points during a SNR-evolution with Esn — 
8 X 10^° erg exploding into a medium at rest with next = 0.3 cm“^ 

which corresponds to about 3750 years for an external density of next = 
0,3 cm"^ and a SN-energy of £?sn = 8 x 10^^ erg. After this age the reverse 
shock starts to propagate inwards and re-heats the interior to high tem- 
peratures followed by the pressure-driven expansion which is known as the 
Sedov-Taylor phase. In Fig. 6.2 the space-time evolution of the grid points is 
plotted between 1 and 10® years in the range 10^^ cm and 10^^ cm. Up to the 
sweep-up time tgw we see the ballistic expansion where all discontinuities are 
moving outwards together. The innermost line-concentration begins to move 
to the left and the horizontal line at \ogt = 3,57 marks the reverse shock 
running towards the interior. Since the temporal smoothing Tg is set to zero 
all grid points have to react to this motion immediately, in particular visi- 
ble through the paths of grid points located in the external medium. Inside 
the forward shock the remaining density structures are resolved by the grid 
although they have no dynamical influence on the overall expansion. During 
the expansion phase as well as during the Sedov phase the adaptive grid al- 
lows large time steps. Since only every third time step is plotted in Fig. 6.2 
some grid lines appear to have sharp corners. As also shown in Fig. 6.5 the 
evolution for times t > tcoo\ is strongly influenced by radiative cooling which 
causes a further rearrangement of the grid points after logt > 4.7. 

Figure 6.3 displays the location of the reverse and forward shock in a 
logarithmic diagram during the transition from the expansion phase to the 
pressure- driven Sedov phase. The estimate of from Eq. (6.3) is drawn as 

a vertical dashed line yielding almost an asymptotic line for the locations 
of the inward propagating reverse shock. During the following Sedov phase 
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log t [year] 

Fig. 6.3. Space-time diagram of the two SNR shock waves compared to a self- 
similar Sedov-solution (shifted straight line) 



the motion obeys a self-similar solution with Rs oc which is plotted for 
comparison as a shifted straight line. During such self-similar expansions the 
adaptive grid allows large time steps because the structures remain almost 
stationary on the moving mesh. 

As mentioned before, the late SNR phases are dominated by radiative 
cooling and a typical cooling curve is plotted in Fig. 6.4. The symbols above 
the curve denote the chemical elements mainly contributing to the features 
seen in the curve. In particular, below 10"^ K the radiative cooling becomes 
very ineffective and the huge drop in cooling efficiency near 10^ K works like 
a thermostat for Hll-regions. Around 3 x 10^ K another minimum is found 
because all elements are fully ionized and the plasma can therefore radiate 
only through free-free transitions (i.e., bremsstrahlung) leading to a cooling 
law of A{T) (X For many applications and analytical considerations 

Kahn’s approximation of the cooling function in the range between 2 x 10^ < 
T/[K] < 10^ (Kahn 1976) has been adopted which can be stated as a simple 
power law of the temperature 

A{T) = 1.3 X ergcm^s'^ (6.4) 

shown in Fig. 6.4 by the dashed straight line. Such a cooling curve is based 
on equilibrium conditions which are not fulfilled in many cases for a shock- 
heated gas. Nevertheless, as a first approximations this energy loss term can 
be used during the late stages of a SNR evolution. I depict this curve also 
to urge the use of a smooth interpolation scheme (see Sect. 5.1) since any 
further (numerical) oscillations will significantly reduce the convergence rate 
in the Newton-Raphson iteration. 
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Fig. 6.4. The radiative cooling curve A{T) plotted with three analytical approxi- 
mations (dashed and dotted lines) used in the literature 



The cooling phase in a SNR is characterized by the formation of a dense 
shell behind the compressed interstellar medium. Assuming, e.g., Kahn’s cool- 
ing law (6.4) the time and the radius where such a shell is formed can be 
estimated by (see, e.g., McKee 1982) 

iishell - 20 ^jq 51 gj.g^ pc (6.5) 

and no denotes the number density of the external medium in cm”^. Due 
to the adiabatic expansion the temperature of the SNR decreases during the 
Sedov phase. Below 10^ K the radiative cooling sets in triggered by the re- 
combination of the heaviest elements. In Fig. 6.5 four snapshots of the SNR 
with Esn = Sx 10^^ erg and no = 5 cm~^ leading to Rsheii = 9.7 pc are plot- 
ted at at 1.4 X 10^, 3.6 x lO'^, 6.1 x 10^ and 9.7 x 10^ years. The development 
of such a dense shell (Fig. 6.5a) is exhibited and in good agreement with the 
analytical estimate of Eq. (6.5), The shock structure reveals an important re- 
duction of the shock speed seen through the drop in the velocity (Fig. 6.5b). 
The leftmost curves correspond still to a Sedov-like structure as inferred from 
the small pressure variations inside the remnant (Fig. 6.5c). The second curve 
clearly exhibits the cooling region behind the radiative shock, where the post 
shock temperature can even drop below the up-stream temperature since 
the adiabatic expansion in the post shock region adds the radiative cooling 
(Fig. 6.5f). The very tenuous SNR interior is at least two orders of magnitude 
less dense than the post shock region and is therefore unable to cool since 
the radiative cooling scales with the particle density squared. As plotted in 
Fig. 6.5d the shock wave is accelerating energetic particles and this acceler- 
ation process depends on the overall compression ratio of the shock wave. 
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Fig. 6.5. Late SNR evolution dominated by radiative cooling (see text for more 
details) 



The accelerated particles increase the cosmic-ray pressure Pqr at the shock 
wave and this gyro-resonant acceleration process can be characterized in the 
hydrodynamical limit by a diffusive process (e.g., Drury 1983 for a compre- 
hensive review on particle acceleration). Mainly two effects modify the SNR 
evolution at these late phases. First, these high-energy particles can also dif- 
fuse into the up-stream region of the shock front generating a precursor as 
visible in Fig.6.5d where the incoming plaisma is already decelerated. Sec- 
ond, the additional particle pressure prevents the cooling region from further 
collapsing. Note the linear scale on the particle pressure plot 6.5d. At later 
times, the temperature of the post shock region decreases further and the 
cooling becomes less effective as can be seen in the thickening of the dense 
post-shock shell. The thickness of the cooling zone is determined by the effi- 
ciency of the cooling and the maximum of the cooling curve A{T) is around 
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10^ K as inferred from Fig. 6.4. Since this radiative cooling is mainly observ- 
able in X-rays the details of this cooling region are important for comparison 
with observations. Any resolution error will change the fluxes because the 
energy radiated away scales like p‘^A{T) and a poor resolution of the density 
structures influences the cooling and this post-shock pressure drop reacts im- 
mediately on the shock propagation. Hence, these later SNR phases depend 
strongly on the quality of the numerical scheme, in particular on the ability 
to trace very fine spatial structures. The radiative energy losses are depicted 
in Fig. 6.5, where the luminosity is plotted in units of 10^ L©* The emission 
is strongly peaked at the dense shell demanding a very fine resolution. 

6.2 Nonlinear Stellar Pulsations 

Since the first attempts to calculate nonlinear pulsation models it has become 
clear that the numerical representation of the driving zones is crucial for 
obtaining accurate results. The stellar body is divided into discrete shells 
and hence almost all computations have been performed on a Lagrangian grid 
which keeps the mass fixed to a computational cell (or grid index). Within 
the ionization zones the resolution varies during a pulsation cycle because 
the location of the ionizing and driving regions do not remain constant with 
respect to the Lagrangian mass coordinate (see, e.g., the proceedings edited 
by Buchler (1990) for a detailed discussion of these issues). Due to the vast 
number of observational tests a number of long standing problems based on 
the Lagrangian numerical tradition have been emerged, e.g., the numerical 
light curves are very “noisy”, the amplitudes do not agree with observed 
ones, the damping due to large values of the artificial viscosity influences 
the results, the shape of light curves differs in the Fourier coefficients (see 
Eq.(6.6)), etc. The majority of these effects can be solved with adaptive 
methods although better tables for the opacity had an enormous impact on 
stellar pulsation theory (e.g., reviews by Gautschy & Saio 1995,1996). 

The inner boundary of the pulsation models is located above the nuclear 
active region, usually around a gas pressure of log P ~ 12 where the amplitude 
of the oscillations is practically zero. The outer boundary we specify within 
the stellar atmosphere, where the gas pressure is several orders of magnitude 
lower than the photospheric value. This procedure avoids artificial driving of 
the stellar pulsation. We start from an initially hydrostatic configuration and 
disturb the model by a very small random velocity field. 

RR Lyrae stars are low mass (M ~ 0.5 M©) variable stars with a rela- 
tively low metal content as they belong to Population II stars, i.e., for the 
models discussed we used (X,Y,Z) = (0.7,0.299,0.001). These stars exhibit 
a narrow range of visual magnitudes and their periods range typically from 
about 0.2 days to 1.2 days. Depending on the amplitude, period, and the 
shape of the light curve, several different classes of pulsations have been in- 
troduced. The RRab subclass is characterized by a steep ascending branch 
with generally asymmetric light curves belonging to the fundamental mode 
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Fig- 6.6. RR Lyrae hydrostatic initial model 



pulsation, whereas the RRc subclass exhibits almost sinusoidal light curves 
generated by a first overtone pulsation. The typical range of luminosities 
peaks at about 60 Lq which makes these stars less suitable to estimate the 
distances of external galaxies. Hence, observational studies are restricted to 
our galaxy. Furthermore, it is well known that these stars undergo nonlinear 
pulsations in the fundamental mode, in the first overtone, or in both modes 
simultaneously. A large number of numerical computations have been carried 
out by various authors (e.g., Stellingwerf 1975, Kovacs & Buchler 1993, Bono 
& Stellingwerf 1995, Feuchtinger & Dorfi 1994, 1996, 1997) to explore the 
stability of RR Lyrae models. Since RR Lyrae stars are extensively studied 
objects from observational grounds as well as from theoretical modelling, any 
result based on a new development can be compared to a number of results 
available it the literature. 
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The outer radial structure of the hydrostatic initial model is plotted in 
Fig. 6.6 between 3.26 x 10^^ cm and 3.72 x 10^^ cm. Every grid point is shown 
individually to illustrate the good resolution of the physical quantities. In this 
case, the stellar parameters are given by M = 0.65 M©, L = 52.5 L©, and 
Teff = 6800 K. Focusing on the outer parts of the model, we clearly can 
identify the stellar atmosphere by the rapid spatial decrease of the density 
(Fig. 6.6a) and gas pressure (Fig. 6.6b). The temperature (Fig. 6.6c) remains 
constant within the atmosphere. Within this structure we see also the den- 
sity inversion due to the opacity (Fig. 6.6d) which varies over 5 orders of 
magnitude. Note, that these are purely radiative models where no convective 
energy transport is included. Below the stellar photosphere are located the 
driving zones as demonstrated through the variations of the thermodynamic 
quantity Vad = dlnT/dlnP, which clearly mirrors the stellar driving zones 
(Fig. 6.6e) and varies within the ionization zones (e.g., Kippenhahn & Weigert 
1990). It is therefore essential to distribute enough grid points in these re- 
gions to obtain an accurate description of the stellar pulsation.' At a radius 
of about 3.45 x 10^^ cm (logi^ = 0.54 [10^^ cm]) we find the He Il-ionization 
zone. At 3.618 x 10^^ cm (logi? — 0.56 [10^^ cm]) the H-ionization zone is 
superimposed on the broader He I-ionization zone. The quantity in Fig. 6.6f 
displays the point concentration and reflects the clustering of the grid points 
near prominent physical features. Since the stellar pulsation is driven by these 
variations it is important to resolve the gradients of the physical quantities 
during the pulsational cycle. 




Fig, 6.7. Calculated RR Lyrae light curve in bolometric magnitudes Mboi for ^ = 
0.578 Mq, L = 64.3 L© and Tefr = 6500 K (solid line) together with the Fourier 
fitted light curve shifted by 0.2 mag (dashed line) and the difference Mboi — Affit 
(thin solid line), shifted by 0.8 mag 
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I want to emphasize that inside the minimum of the Hell-ionization zone 
the computations use a Lagrangian grid to avoid the small advection errors 
always present in case of transport over cell boundaries. A more detailed dis- 
cussion on this Lagrangian switching point is found in Sect 7.1. The following 
computations are done with N = 200 grid points resolving the density, the 
internal energy as well as Vad, i e., 
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The Fourier decomposition of RR Lyrae light curves has been carried out 
by several authors to compare the light and radial velocity curves of nonlinear 
pulsation models with observational results (e.g., Simon & Teays 1982, Simon 
1985). The observed or theoretical light L{t) curves are least square fitted as 
a function of time t by 

K 

L{t) = Ao + cos {ku;{t - to) + #fc) . (6.6) 

k~l 

K is usually set to 8 and Ak,^k^ ^ smd to denote the Fourier amplitudes and 
phases, the frequency and an arbitrary zero point, respectively. The param- 
eters involved in Eq. (6.6) are determined from a nonlinear least square fit. 
From the Fourier amplitudes and the phases the following low order combi- 
nations are derived 

^21 = ^ 2 /Ai , ^ki ~ 5 (6.7) 

and then employed for the comparison between theory and observations. Since 
the values of R 21 as well as the Fourier phases are very sensitive to details of 
the shape and amplitude of the light curve, the theoretical models must be 
compared to observations in a more quantitative way (see also Sect. 7.1). 

Figure 6.8 plots a typical RR Lyrae light curve (M = 0.65 Mq, L = 
52.5 L0, Teff = 7000 K) resulting from the adaptive RHD-computations. 
Since the grid points located within the driving zones provide sufficient reso- 
lution the light curves exhibit a very smooth character usually not obtained 
with Lagrangian codes. As pointed out earlier, the ionization zones remain 
neither constant in mass nor in space and a varying number of grid points 
within such zones yields a varying driving force becoming visible in fluctua- 
tions of the light curves. However, the period of the oscillation is quite robust 
and therefore the difference in the oscillation frequency between adaptive and 
Lagrangian computation is usually small. 

On top of these RHD grey computations, a frequency dependent radia- 
tion transfer can be done to obtain the spectral information. The methods 
are described in Sect. 3.14. Besides the frequency-dependent opacities Ku, no 
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Fig. 6.8. Calculated RR Lyrae light curve in bolometric magnitudes for M = 
0.65 Mo, L = 52.5 L© and Teff = 7000 K together with the U,B,V colours. All 
curves are plotted around their mean vzdue (m) to show the difference in the colours 
over a pulsational cycle 



new ingredients are needed to perform such calculations. It is clear that the 
frequency-integrated means like, e.g*., the Rosseland mean «r used for the 
RHD computations, have to be obtained from the same frequency-dependent 
opacities Kp. The non-grey computations have to begin deep enough in the 
stellar interior for all frequencies of the radiation field to be still isotropic. 
Applying the theory of radiative transfer, the boundary condition of a black 
body with a temperature T = T{tn) and a Rosseland mean kr = «^R(ryv) 
can be specified at the inner boundary by 



, / \ M Skji H L 

dT 2aRT3 ’ “ 16^^ 



( 6 . 8 ) 



where /i is the angle between the outgoing ray and the radial direction (see 
Sects. 3.14 and 3.15). 

After the non-grey radiative transfer is computed, the theoretical spectra 
are folded with the spectral filter curves to obtain the standard U, B, V 
photometric colours as plotted in Fig. 6.8. All curves are normalized to their 
mean values over a period so that the differences during a pulsational cycle 
become evident. 

Hydrogen deficient carbon stars (HdC) are characterized by a peculiar 
chemical composition and they consist mainly of helium {Y ~ 0.9) and car- 
bon (Zc — 0.07) with very low hydrogen contributions (X (i:: lO""^). Some 
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Fig. 6.9. A nonlinear stellar pulsation of a hydrogen deficient carbon star initiated 
by a small random velocity fluctuations with u — .Olcg. a) The stellar luminosity 
in units of lO'^L©. b) The photospheric velocity in kms”^ c) The radius of the 
photosphere in units of Rq 



of these HdC are also known as R CrB stars, where the large luminosity 
variations can be explained by obscuring of newly formed dust particles in 
the expanding carbon-rich circumstellar envelope. Most of these HdC stars 
exhibit radial pulsations with large amplitudes in the atmospheric velocities 
but small changes in the luminosities. The origin of the HdC stars is still 
a controversial issue since the apparent absence of hydrogen is difficult to 
explain. Theories of last helium flash scenarios and white dwarf merging are 
discussed in the literature (e.g., Paczynski 1971, Schonberner 1986, Iben et 
al. 1995). 

HdC-stars can exhibit a rapid growth of nonlinear pulsations due to 
so-called strange modes. Figure 6.9 shows an example with M = IM©, 
L — 10^ L© and Teff = 6000 K. The nonlinear pulsation manifests itself 
quickly as a high-overtone mode with small luminosity and velocity varia- 
tions with a period of 10.5 days. The photospheric velocity oscillates between 
±4kms~^, the relative radius variations are AR/R = 0.014 and the scale of 
the bolometric luminosity is given by ZlMboi = 0.2 mag. 

Due to the increased resolution available by the adaptive-grid method the 
nonlinear pulsations can be followed in much greater detail over a long time 
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Fig. 6.10. LBV Pulsation showing a transition into another mode 



interval than hitherto. Since the structures remain almost stationary on the 
adaptive grid the numerical time steps increase and the computations can 
be performed on workstations. In order to compare these results with ear- 
lier investigations we have only presented pulsation calculations with the old 
Los Alamos opacities. However, it has become evident that the details of the 
light variations during the pulsation depend not only on the opacities used 
but at the same level on the exact treatment of the radiative transfer in the 
outer layers. All models presented here are calculated with the inclusion of 
a stellar atmosphere to obtain the right boundary conditions in the case of 
outwards propagating waves. The pulsations of HdC stars are highly nonadi- 
abatic leading typically to large velocity and small luminosity variations. 

Luminous blue variables (LBV) are among the most luminous stars shin- 
ing with luminosities up to 10® L© having masses up to around 100 M©. 
Due to their high luminosities and variability they are of great interest for 
theoretical and observational studies, in particular as possible steps in the 
extragalactic distant ladder. As encountered in many stars with a high lu- 
minosity to mass ratio, strong non-adiabatic pulsations are excited (see, 
e.g., Gautschy & Saio 1995, 1996). Figure 6.10 depicts an example with 
M = 90 M©, L = 1.32 X 10® L©, Teff = 35 900 K and = 29.9 i^©. Ac- 
cording to a linear stability analysis this star is unstable in the 2 “^-overtone 
with a period of P 2 ~ 0.88 days. Our radiation-hydrodynamic simulations 
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are exited with random fluctuations in the velocity field with 0.1 Cg. The 
pulsation evolves into an oscillation pattern which differs from the linear pre- 
diction as can be inferred from the long-term behaviour plotted in Fig. 6.10. 
Part of the kinetic energy is transferred into the stellar atmosphere which 
thereupon expands. As a consequence, the pulsation build up about a differ- 
ent equilibrium radius at 1.18/2* and = 33000 which leads to 

an increased pulsation period of — 1.13 days. This scaling agrees with 
P (X p Note, that such behaviour can only be investigated by nonlinear 
computations where a linearization around the initial equilibrium model is 
not assumed. In the case of more unstable modes, a more complicated light 
curve is expected which possibly leads to an explanation of the observed LBV 
eruptions. 

6.3 Protostellar Collapse 

Protostellar collapse calculations are a long standing problem for computa- 
tional RHD. Beginning with the work of Larson (1968) it became evident 
that due to an accretion shock around a hydrostatic core very complex flow 
structures are expected and that the energy generation in the shock fronts is 
the main source of the emerging radiation. The initial conditions are taken 
to be a Jeans-unstable interstellar cloud embedded in an external isotropic 
radiation field, typically with Text = 100 K. The collapse proceeds in several 
phases where a number of different physical effects modify the flow struc- 
ture. First, at the beginning the collapsing gas remains isothermal since the 
optical depth is so small that all compressive energy gains can be radiated 
away. As the material becomes denser the energy gets trapped, the gas is 
heated up and around 2 000K molecular hydrogen starts to dissociate (see, 
e.g., the decrease of Pi in Fig. 2.2) and reduces the gas pressure which trig- 
gers the so-called second collapse. Second, a hydrostatic core is formed which 
will contract on its thermal time scale during the further evolution. This core 
is now surrounded by a strong radiating shock wave which decelerates the 
infalling material and allows a gravitational settling on the growing central 
core. Third, the large temperature and density variations yield large opac- 
ity changes within the flow which necessitate a calculation of the Eddington 
factor. Having a very isotropic radiation field near the center the radiation 
is dominated further out by the accretion shock and finally becomes again 
isotropized very far out by the dust particles embedded in the gas. Note that 
this accretion of material is time-dependent and the shock front will vary 
in strength and position. The overall evolution occurs in an extremely non- 
homologous way. All these complications discussed in the previous sections 
impose very strict conditions on the numerical treatment since many orders 
of magnitude in space and time have to be covered. Actually, these problems 
led to the first developments of adaptive grids (Winkler 1975). 
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The protostellar collapse calculation (G, Wuchterl, private communica- 
tion) presented is performed with N = 1000 grid points and the following set 
of grid parameters 
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To provide a smoother variation in time it can be advantageous to set, 
e.g., Tg = 10“^iff,i, where the local free-fall time up to the radius ri has been 
introduced: 

/ Stt . , _ Smi . . 

'"•' = V320A " = 

The grid is determined by the variations in the density and the internal 
gas energy. Due to the large dynamical range all variables are scaled by the 
harmonic mean. The temporal smoothing is done locally by allowing the grid 
to move at maximum only at a small fraction of the local free-fall time scale. 

The first full hydrodynamical calculation of a radiating collapse flow in- 
cluding realistic opacities and an appropriate EOS have been performed by 
Winkler and Newman (1980ab). A typical snapshot of the flow structure at 
3.8 X 10^ years is depicted in Fig. 6.11 between 10^ cm and 10^^ cm. The 
density (Fig. 6.11a) reveals a free falling envelope with p oc an accre- 

tion shock at logr = 11.85 and a hydrostatic core inside. The scale in the 
density at this stage is about 20 orders of magnitude. The velocity struc- 
ture of Fig. 6.11b clearly shows how the infalling material is decelerated at 
the shock front from 25kms“^ to a small fraction of the sound velocity. To- 
gether with the luminosity in units of Lq plotted in Fig. 6. lie it becomes 
clear that almost all radiation is generated in the accretion shock and only 
slightly modulated by the infalling plasma. Figure 6.11c shows the variation 
of the gas pressure ranging over almost 30 orders of magnitude. The tem- 
perature profile exhibits several interesting features seen in Fig. 6. lid. The 
modulations in the collapsing material are causes by changes in the opacity 
(Fig. 6. Ilf). Around 300 K the dust particles loose their icy mantles and at 
1800 K the remaining core of the dust particles get evaporated leading to the 
bumps in the opacity. The radiating shock front appears as a sharp peak in 
the temperature which is also accompanied by large variations of the opacity. 
Going further inside the accretion shock is accompanied by an increase of the 
temperature due to a slow contraction of the protostellar core. Note that the 
temperature inversion within the central parts of the hydrostatic core is still 
revealing the variations in the accretion history. At earlier times the material 
has been falling with lower velocities on the smaller and less compact central 
mass and therefore the compressive heating has been less effective. Hence, 
during the contraction towards the main sequence this temperature inversion 
has to vanish by heating from the outer hotter layers. Since the primordial 
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Fig. 6.11. Protostellar collapse of a 1 Mq fragment at t = 3.8 x 10^ years (see text 
for more details). Courtesy of G. Wuchterl 



deuterium is the first material to start a nuclear energy generation the onset 
of nuclear burning will not occur in the center. 

To illustrate the resolving power of the adaptive grid, the vicinity of the 
accretion shock is depicted in more detail in Fig. 6.12, where except for the 
luminosity the same variables are plotted as in Fig. 6.11. The radius is given 
in units of the shock radius Rs = 10.17 i?0 and one tick mark on the x- 
axis corresponds to {Ar)lr = 10“^. However, the grid clustering leads near 
the shock wave to almost {Ar)/r = 10“^ as can be inferred from Fig. 6.12f. 
The incoming gas with a temperature of Ti — 1620 K is heated up in the 
radiative shock wave to T 2 ~ 6700 K and cools down again in the post region 
to a minimum temperature of T3 = 1680K at R/Rs = 0.9899 which cannot 
be shown on the scale of the plot. Hence, this structure is described as a 
supercritical shock wave with Ti c::^ T3 T2 (Zeldovich & Raizer 1969). As 
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Fig. 6,12. Structure of the accretion shock front at Rs = 7.08 x 10^^ cm (see text 
for more details). Courtesy of G. Wuchterl 



mentioned in the introduction (Sect. 1.3) the length scale of such a shock is 
of the order of the mean free path Ip of the photons. The stability of this 
accretion fronts has been investigated by Balluch (1988). He demonstrated 
that using an adaptive grid permits the artificial viscosity length scale to be 
reduced to values smaller than Ip. 

6.4 Dust-Driven Winds 

The next example deals with a stellar outflow produced by the radiation 
pressure on newly formed dust grains. The outer layers of extended, lumi- 
nous old Stax with typically M ~ 1 Mq and L 10^ Lq and low effective 
temperatures around Teff — 3000 K get enriched by heavier elements, mostly 
carbon and oxygen produced through nuclear burning in the stellar interior 
and mixed upwards by convective motions (see, e.g., Iben & Renzini 1983 
or Lattanzio 1995 for reviews on the stellar evolution aspects). In the atmo- 
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spheric layers dust particles can condense out of the gas and these freshly 
formed particles provide an important source of opacity, the radiation pres- 
sure acts on them and by collisions with the dust particles a massive but 
slow outflow is driven. Since a large fraction of these old red giants are pul- 
sationally unstable, they have therefore large-amplitude waves propagating 
through the atmosphere. The adiabatic expansion in the post-shock regions 
leads to a temperature reduction and the material lifted up by the pulsations 
is dense enough to form a large number of small solid particles which can 
than grow further in the expanding and accelerating flow. Hence, the set of 
RHD equations has to be supplemented by the equations of time-dependent 
dust formation (Gail & Sedlmayr 1988, Gauger et al. 1990). A discrete ver- 
sion based on the same finite volume discretization as presented in Sect. 3.13 
can be found in Dorfi and Hofner (1991). 

The equations of RHD including the time-dependent dust formation give a 
set of M — 13 nonlinear coupled equations and the solution strategy outlined 
in the previous sections can be used to calculate the complex interaction 
between dust, radiation and matter. The solutions illustrating the behaviour 
of the adaptive grid with N = 500 grid points are dominated by multiple 
shock waves propagating through the circumstellar envelope. The radiation 
pressure on the dust grain provides the driving force to accelerate the flow. 
The stellar pulsation is simulated at the inner boundary by a moving piston, 

RnH) = /?Ar(0) + Z\[/^sin(^) , 

= 47rRjs;{t) Hisj , ( 6 . 10 ) 

where the pulsation period P and the velocity amplitude AU are free param- 
eters. The luminosity Ljv at the inner boundary undergoes the corresponding 
sinusoidal variation to account for the periodic modulation of the luminosity 
caused by a K-mechanism. The initial models can be obtained from a dust- 
free stellar atmosphere and as the pulsation starts the gas expands and cools. 
Dust particles can condense which leads to a further expansion. The outer 
boundary is taken to move according to the Lagrange conditions (3.31) and 
the whole structure grows in time until the outermost layers reach the escape 
velocity. Now the outer boundary can be replaced by a simple outflow condi- 
tion, i.e., du/dr ~ 0 which can easily be implemented (see, e.g., Eq. (3.29)). 
The boundary conditions of the radiation field are given by Eqs. (3.59) and 
(3.60). 

The three time levels of a dust driven wind plotted in Fig. 6.13 axe 
based on the following set of parameters, M* = IMq, L* = 10"^ Lq and 
Teff = 2600 K with a carbon abundance of ed^o = 1.8, a piston velocity of 
AU — 2 km s“^ and a period of P = 650 days. The inner boundary is situated 
at Rn = 0.91 R* with The density structure (Fig. 6.13a) 

exhibits the extended outflow plotted up to 20 stellar radii R# = 445 R©- 
The velocity (Fig. 6.13c) shows the shock waves propagating outwards and 
the typical outflow velocities are around 25 km s”L Note that within two stel- 
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Fig. 6.13. A typical flow pattern with a dust-driven wind (see text for more details) 



lar radii the material is partly falling towards the star and the shock waves 
have to travel against this incoming flow. The gas temperature of Fig. 6.13c 
reveals the radiating shock waves by the temperature increases followed by 
a very thin relaxation layer. Since a number of molecules are excited by 
these shock waves the peak temperatures can be measured and the adaptive 
grid provides the necessary resolution to calculate this non-adiabatic shock 
transitions. The innermost shock waves are located at large optical depths 
and the absorption due to dust particles can block the radiation field. The 
radiation field heats the dust particles which are emitting radiation them- 
selves also back into the extended atmosphere. This so-called backwarming 
can supress the formation of new particles, the expansion is slowed down, 
the gas becomes denser again which increases again the production rate of 
dust particles. Due to this time-dependence a number of these dusty outflows 
are unstable with respect to the so-called dust-induced /c-mechanism (Fleis- 
cher et al. 1995, Hofner et al. 1995). The lowest panel (Fig. 6.13d) plots the 
degree of condensation /cond of the carbon-rich dust indicating that a large 
fraction of the carbon available for the formation of dust particles has been 
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transformed into solid grains, /cond = 1 means that all available carbon is 
condensed into dust particles. 

6.5 Radiative Transfer 

The system of RHD equations can be closed by calculating the Eddington 
factor from the time-independent radiative transfer equation (Sect. 3.16). In 
Fig. 6.14a the behaviour of the Eddington factor as well as the temperatures 
(Fig. 6.14b) are plotted in more detail where the solid and dashed lines are 
depicting two different time levels. For the earlier model the dotted line shows 
the radiative temperature Trad shifted by 1500 K since it runs together with 
the gas temperature Tgas except within the relaxation layers of the radiating 
shocks. From Trad the aforementioned effect of backwarming is clearly visible 
at the inner shock wave. 
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Fig. 6.14. Behaviour of the 
Eddington 

factor, the gas temperature 
(solid and dashed lines) and 
radiation temperature (dot- 
ted line shifted by 1500 K) in 
a dust driven wind. The solid 
and dashed curves in the top 
panel refer to two different 
time levels 



Within the dust driven winds presented in Sect. 6,4 the condensation of 
solid particles leads to an increase of the opacity and therefore to a isotropi- 
sation of the radiation field in these zones of dust particles. As demonstrated 
in Fig. 6.14a the Eddington factor varies according to these changes and the 
two time levels illustrate clearly how the dust particles produced in discrete 
shells reduce / towards the isotropic value of 1/3. Since the shells get thinner 
and the central source receds, the influence of the shells is reduced and the 
Eddington factor increases towards the asymptotic value of / = 1 at large 
distances. 

Adopting the Lucy- approximation (2.16) for an extended static atmo- 
sphere an Eddington factor /l can be estimated. For the two models shown 
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Fig. 6.15. a) The ra- 
tio of the Eddington fac- 
tor / to the Eddington 
fcictor ccilculated from the 
Lucy- approximation /l and 
b) the radiation tempera- 
ture Trad to the temperature 
Tl obtained by the Lucy- 
approximation. The same 
two time levels as in the pre- 
vious Figure are plotted 



in the previous figure the ratio of /l// is plotted in Fig. 6.15 to illustrate the 
differences between the full radiative transfer treatment and such approx- 
imations. Although the overall structure is maintained the deviations are 
pronounced in the vicinity of dust shells and in the innermost parts where 
the dust formation process is very sensitive to temperature changes. The 
corresponding radiation temperature based on the Lucy-approximation Tl 
is plotted against Trad obtained from the full set of RHD equations. The 
errors depend strongly on the particular structure of the RHD model and 
almost everywhere the radiation temperature is underestimated. However, 
this comparison is based on values of Tl and /l calculated on top of a fully 
solved RHD system and does not include the back reaction of the different 
temperature and radiation pressure structure on the stellar outflow. Hence, 
taking the differences into account already seen in this figure I expect much 
larger deviations when the wind structure is obtained without computing 
the Eddington factor. First, the temperature stratification differs in the dust 
forming regions and secondly, the growth time of dust particles in the outflow 
is determined by the radiative forces which accelerate the particles. 
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7. Discussion 

This section deals in some detail with further extensions as well as problems 
concerning the numerical method presented. 

7.1 Internal Accuracy 

The internal accuracy of the computations depend on various factors. First, 
the number of grid points determines the spatial resolution possible and hence 
the local discretization errors which are unavoidable in numerical compu- 
tations. The CPU-time required scales linearly with the radial zones and 
therefore I recommend larger number of radial zones than usually taken for 
one-dimensional calculations. Most applications presented in the previous 
sections use at least 300 grid points. To check the results it can be quite 
useful to repeat some test cases with different numbers of radial points. Sec- 
ond, the way how the differential operators are represented on the discrete 
cells, in particular how well material can be transported through the numeri- 
cal grid. These so-called advection terms have been discussed in detail within 
the mathematical part of the lecture notes by LeVeque. However, in many sit- 
uations the advection schemes are developed for an equidistant radial spacing 
and/or plane geometries. Without taking into account such special properties 
of higher order schemes the gained accuracy can be lost by computing the 
local fluxes across the zones. Nevertheless, I want to emphasize again that the 
adaptive grid technique is not restricted to a particular discretization scheme 
as long as it is suited to handle a time- variable grid position. 

In Fig. 7.1 the long-term evolution of two RR Lyrae pulsation calculations 
are plotted versus the pulsational cycles. The different curves show compu- 
tations with different Lagrangian switch points (see Sect. 6.2), i.e., the grid 
point inside which the grid is moving according to a Lagrangian motions 
without advection terms. The full symbols correspond to a switch point at 
logTLag = 4.6, the open symbols to logT^ag = 4.9. The pulsation charac- 
teristics are almost unchanged as long as this switch point is not located 
too deep log^Lag < 4.7 to reduce the advection errors. The pulsational am- 
plitude Fig. 7.1a becomes constant at different levels also depicted by the 
Fourier ratio R 21 in Fig. 7.1c. The period is also almost constant to a level of 
10~^ as inferred from Fig. 7.1. Since the equation of internal energy (2.3) is 
solved the conservation of the total energy can be used to check the internal 
energy and Fig. 7. Id depicts its evolution. Unfortunately, the total energy 
errors increases linearly but this discretization error is unavoidable because 
the same number of grid points are used to resolve a more compact structure 
during the contracted phase and a more extended stellar structure during the 
expanded pulsational phase. Nevertheless, this error seems not to influence 
the pulsational properties as seen in the previous figures. 
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Fig. 7.1. Long term evolution of pulsational calculations 



7.2 Problems 

Associated with the use of the physical equations like the internal gas energy 
equation one has to monitor the conservation of total energy during the com- 
putational sequence and the errors have to be smaller than any other energy 
content influencing the physical problem. The better the physical quantities 
are represented on the grid the smaller the errors occur in conserving the to- 
tal energy, both in space and time. In principle it is possible to use the total 
energy equation to calculate the internal gas energy or temperature, but the 
strict enforcement of total energy conservation will load all computational 
errors onto the internal gas energy leading in some cases to negative values. 
Such situations can occur when, e.g., a very cold plasma is streaming at high 
velocities where the kinetic energy of the gas totally dominates the gas in- 
ternal energy. Hence, in such cases small errors in iterating the gas velocity 
immediately show up in the gas internal energy, possibly as negative values. 

As seen in the previous examples the artificial viscosity can be reduced 
by large orders of magnitude through an adaptive grid. Nevertheless, some 
small amount of viscosity is still left and will influence the overall properties, 
e.g., of pulsational calculations. In Fig. 7.2 the dependence of the pulsational 
properties is plotted against the artificial viscosity coefficient (3.35) in units 
of 10"*^. The value of Q 2 is reduced to a value around 10“^ which defines the 
local thickness of a shock front. Taking smaller values of the viscosity leads 
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Fig. 7.2. Pulsational properties as a function of the artificial viscosity parameter 



to a constant pulsation period (Fig. 7.2a), The dissipation of the code is 
further reduced as seen through the variation of the Fourier parameter (6.6) 
in Figs. 7.2b,c,d. The results presented in the Sect. 6.2 have been obtained 
with Q 2 = 1.5 X 10“^ because the observed RR light curves impose a limit 
on these variations. In real stars also nonlinear dissipation mechanism will 
operate to limit such pulsations although their physical nature may not be 
described adequately by such a viscosity term. 

As pointed out in the previous section it is necessary to check the develop- 
ment of conservation of total energy during the computational run. It is clear 
that this errors must be smaller than the energies involved in the physical 
processes. As already mentioned the errors result from the resolution and the 
representation of the quantities on a finite cell size. On the other hand the 
advection scheme plays a crucial role when either large velocities or rapid 
grid motions occur and higher order advection schemes might be invoked to 
limit these errors. 

7.3 Advantages and Disadvantages of the implicit formulation 

As mentioned earlier, for the iterative solution of our finite volume discretiza- 
tion a well-behaved convergence is necessary for which the corresponding Ja- 
cobi matrix needs to be determined. The matrix can be calculated either 
analytically or numerically but for most astrophysical applications with large 
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variations in the physical variables an analytic evaluation yields faster con- 
vergence rates. Hence, any small change of the discretization scheme enters 
the Jacobi-matrix by numerous changes in the derivatives which makes the 
testing and debugging of such an implicit code rather lengthy. Due to the 
large matrices which have to be inverted at every time step, this method has 
been applied only to one-dimensional problems but applications to higher 
dimensions are discussed in Sect. 7.5. In any case the derivatives of opacity 
and the equation of state are needed. 

The generation of suitable initial conditions requires some work as out- 
lined in Sect. 4.10. First, a full solution of the RHD system is necessary and 
second, the grid points must be moved to their positions determined implic- 
itly by the grid equation. 

According to the implicit formulation it is possible to calculate stationary 
as well as dynamic situations with one and the same code allowing, e.g. , a di- 
rect comparison with solutions obtained by integrating the time-independent 
ordinary differential equations. We can study the convergence towards sta- 
tionary solutions and the adaptive grid provides a high resolution at locations 
of steep gradients. 

7.4 Nuclear and Chemical Networks and Convection 

The method presented is not restricted to a particular set of RHD equations. 
In Sect. 6.1 the evolution of a SNR has been followed by the equations of 
radiation hydrodynamics including an equation for the cosmic ray particle 
pressure and radiative losses for the thermal plasma. The dust driven winds 
(Sect. 6.4 adopt the full set of RHD equations together with 5 equations 
describing the time-dependent evolution of dust particles. Hence, it is clearly 
possible to extend the method to multi-fluid versions which include nuclear or 
chemical networks. In principle, the implicit nature will allow such a change 
but as discussed in Sect. 5. the CPU-time scales with the number of equations 
to the third power. However, as briefly discussed in the next section a so- 
called hybrid version of an implicit code has been developed by Tscharnuter 
(1987) where due to a Legendre expansion at every grid point a large number 
of unknowns M 50 have to be determined. Such computations are still 
possible on present-day’s workstations. 

Another extension of the method involves a better coupling between gas 
and radiation by introducing the frequency dependence of the opacity and 
the radiative transfer. Hence, instead of having only two gray radiative mo- 
ments the number is increased by the amount of frequency points used in the 
computations which clearly exceeds the CPU capacity when using “realistic” 
numbers (M 1000) of frequency points. However, since the radiation prop- 
erties are usually varying on different time scales it may not be necessary to 
solve all variables with the same implicit scheme. 

In a number of astrophysical objects energy transport by convective mo- 
tions plays an important role. In the case of hydrostatic equilibrium the 
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Schwarzschild criterion (1906) limits the maximum of the temperature gradi- 
ent for stability against such convective motions. The lack of a full theory of 
convection makes it in particular difficult to construct models including this 
additional energy transport and approximations at various levels of complex- 
ity to describe convective motions are available in the literature and critically 
reviewed, e.g., by Baker (1987). A particular model, based on averaging fluc- 
tuating physical quantities over spheres by Kuhfufi (1986) has been reformu- 
lated for adaptive grids and implemented in RHD computations by Wuchterl 
(1995). The work of Gehmeyr and Winkler (1992a, b) deals with another con- 
vective scheme on adaptive grids which has been used by Gehmeyr (1993) to 
calculate convective RR Lyrae pulsations. 

7.5 Multidimensional Versions 

Due to the large Jacobi matrices involved in any implicit method, up to 
now no 2D fully implicit codes have been developed. The numbering of grid 
points in higher dimensions yields a large number of non-zero, non-diagonal 
elements in the Jacobi matrix which makes the inversion rather time con- 
suming. Nevertheless, a so-called hybrid version in axial symmetry has been 
developed by Tscharnuter (1987) which utilizes an expansion into Legendre 
polynomials to calculate axial symmetric self-gravitating RHD flows. Instead 
of having the physical variables on grid points the appropriate Legendre co- 
efficients are advanced in time increasing the unknowns per grid point. Due 
to such an expansion the discretization is done only in radial direction which 
leads to a diagonal block structure of the Jacobi matrix. The inversion proce- 
dure outlined in Sect. 5.4 can directly be used to solve the system of discrete 
equations. 

A different approach has been used by Balluch (1995) who generalized the 
adaptive grid to two dimensions in order to minimize the advection errors. 
He solves the hydrodynamical equations explicitly on the moving coordinates 
but advances the grid according to an implicit equation. The basic idea of 
this coordinate transformation method is to perform the coordinate trans- 
formation before discretization and to construct a grid on which the physical 
variables are more smoothly distributed than in physical space. This trans- 
formation leads to generally curvilinear meshes with a rather complicated 
metric. Again, we refer to Balluch (1995) for a detailed discussion. 

7.6 Improvements and Further Recommendations 

This section deals with some new insights on the discretization on adaptive 
grids. Up to now and due to complex programming these new developments 
are not included in any adaptive RHD-code. 

We can use the adaptive transport theorem (Eq. (2.24)) (G. Wuchterl, 
private communication) to define the grid velocity (3.7) in an alternative 
way by setting / = 1, i.e., 
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or in a discrete version in our symbolic notation, 



(71) 



y = Z\(47rr2uerid) . (7 2) 

Rearranging the terms and specifying the above formula at each grid point 
we end up at a radius r with 
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which is a more geometrical definition than the original one of Eq. (3.7). First, 
we see that in plane symmetry we get the old definition of Secondly, 
from <5r = r — r ° we calculate 



u 





where for large radii r oo also the old definition is revealed. According 
to the derivation from the adaptive transport theorem this velocity is now 
also based on a finite volume formulation. 

As seen in the finite difference formulations of Sect. 3.7 the variables de- 
fined within a computational cell are located in the center of this cell. As 
demonstrated by Monchmeier and Muller (1989) in the context of explicit 
methods the centering of variables on a non-equidistant, non-planar grid be- 
comes of particular importance in multidimensional computations. Hence, 
future versions of RHD codes should discretize all geometrical terms accord- 
ing to their description. Clearly, such an explicit dependence of the radial 
centering increases the derivatives to be calculated for the Jacobi matrix. 
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1. Introduction 

The Oxford English dictionary defines simulation as “an attempt to deceive” . 
It is one of the aims of this lecture to demonstrate that this definition is 
not appropriate in astrophysics. Instead, I hope to show that simulation is 
an indispensable and expedient tool for understanding many astrophysical 
phenomena. In particular, I try to illustrate that simulation is sometimes 
crucial in providing the link between a multitude of bewildering observations 
and a variety of theoretical ideas and models existing for an astrophysical 
phenomenon. 

Another aim of my lecture is to discuss the techniques, the results, and the 
implications of simulations of astrophysical fluid flow. Obviously, a complete 
coverage of this vast and rapidly growing field of research is far beyond the 
scope of this lecture. Thus, I have restricted myself to the discussion of a 
personally biased set of a few specific astrophysical flow problems, the main 
bias being my past and/or present personal involvement into the simulation 
of these flow problems. 

In Sect. 2, 1 will discuss why and how simulations are done and what com- 
putational resources they require. Then I briefly address some basic numerical 
issues, like what are Eulerian and Lagrangian schemes, what is an explicit 
and implicit method, and how are accuracy and efficiency of a method related 
to each other. Concerning the finite volume methods used for integrating the 
hydrodynamic equations the reader is referred to the lecture by LeVeque, 

Section 3 deals with the simulation of core collapse supernovae. After a 
discussion of the observational facts, the physics of spherical core collapse is 
reviewed. Then I discuss observations of supernova SN 1987A, which forced 
supernova model builders to seriously consider nonspherical models. Trig- 
gered by these observations an ever growing set of simulations has been per- 
formed, which convincingly showed that various kinds of generic instabilities 
occur in core collapse supernovae giving rise to mixing and non-radial mass 
motion in the stellar envelope and in the star’s iron core during the actual 
supernova explosion. These simulations are reviewed in some detail, because 
they provide a very instructive example demonstrating the necessity for and 
the potential of hydrodynamic simulations. Next, I discuss simulations of ax- 
isymmetric and non-axisymmetric rotational core collapse. The section ends 
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with an overview of the gravitational radiation emitted during core collapse 
because of convective mass motion, anisotropic neutrino emission and rota- 
tion. 

In Sect. 4, I discuss the interaction of hydrodynamics and thermonuclear 
burning. Here I have not intended to give a review of the respective sim- 
ulations of astrophysical phenomena, like e.g., thermonuclear supernovae or 
novae, but I have rather tried to point out the underlying physical processes 
and technical aspects, which have to be considered for these kind of simula- 
tions. After a discussion of relevant time scales and various types of nuclear 
burning, solution methods for (nuclear) reaction networks are analyzed in 
some detail. Next, I discuss various aspects, which have to be taken into 
account when coupling nuclear reaction networks with hydrodynamic codes. 
Some specific astrophysical applications of thermonuclear hydrodynamics are 
briefly mentioned, too. At the end of the section I present some instructive 
numerical experiments, which show the traps and difficulties encountered 
when simulating thermonuclear detonations. 

In Sect. 5, I review simulations of astrophysical jets the focus being on 
relativistic extragalactic jets. In order to provide the necessary background, 
I first discuss some observational facts and summarize the well-known prop- 
erties of classical jets. Next, special relativistic hydrodynamics is briefly in- 
troduced (see also Sect. 8 of the lecture by LeVeque). The remainder of the 
section deals with the morphology and dynamics of relativistic jets includ- 
ing two short subsections on two recent developments, namely the long term 
evolution of relativistic jets and the simulation of parsec-scale jets. 

Finally, in Sect. 6, the method of Smoothed Particle Hydrodynamics 
(SPH) is reviewed. SPH is a free-Lagrange method solving the equations of 
hydrodynamics without a computational grid. This latter property and the 
simplicity of the basic SPH algorithm are the two main reasons, why SPH 
is so popular, especially among astrophysicist. Besides presenting the basic 
SPH algorithm, I also review extensions of SPH, which allow one to include 
self-gravity and to use a variable smoothing length for improving the spatial 
resolution. After addressing some computational aspects of SPH, I conclude 
with a discussion of two test calculations, which demonstrate the capabilities 
and limits of SPH, 



2. Simulations: A Link Between Observation and Theory 

In astrophysics one is in the extraordinary situation, that the objects of in- 
terest (e.g., stars or galaxies) axe not accessible to any kind of manipulation. 
No experiments can be performed, and observations of a particular one-time 
event (e.g., of a specific supernova) cannot be repeated. Hence, astrophysicists 
have to rely solely on the information they can receive from astrophysical phe- 
nomena via electromagnetic radiation, particle radiation, like e.g., neutrinos 
or cosmic rays, and gravitational radiation. 
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The situation is further complicated by three facts. Firstly, the physi- 
cal processes giving rise to the astrophysical phenomenon may occur deep 
inside the observed object, i.e.,we only have indirect evidence of these pro- 
cesses. This is the case, for example, in a core collapse supernova explosion 
where the explosion of the star is triggered by the gravitational collapse of 
its iron core in a fraction of a second while the observable (electromagnetic) 
supernova explosion only begins hours later (see Sect. 3). Secondly, the pro- 
cesses may involve extreme conditions which are experimentally inaccessible 
in the laboratory. This implies that one has to rely on extrapolation of known 
physics into regimes of density, temperature, magnetic field strengths, gravity, 
etc., which are many orders of magnitude beyond our well-tested laboratory 
knowledge. Thirdly, the processes one tries to understand often occur on time 
scales long compared to the human life span, i.e.,one is stuck with a snap- 
shot of the phenomena. Such a case we will encounter in the discussion of 
extragalactic jets, which are collimated flow structures emanating from active 
galactic nuclei and extending up to several hundred thousands of light years 
into intergalactic space (see Sect. 5). The inferred time scale for the formation 
of these jets is several million years. 

The snapshot restriction is partially alleviated, because observations show 
similar objects (e.g., stars or galaxies) in a large variety of states suggesting 
these states may be connected or even form an evolutionary sequence. Hence, 
snapshots of different objects belonging to the same class provide information 
on the evolution of this class of objects. However, one first has to prove 
that the evolutionary hypothesis is indeed correct and that a specific object 
belongs to the considered class. 

How does one proceed under the circumstances just described? Well, one 
constructs a theoretical model which incorporates all the physical processes 
thought to be of importance for the astrophysical phenomenon. Such a the- 
oretical model in almost all cases involves some degrees of freedom which 
manifest themselves as parameters of the model. The model, if of any value at 
all, should make some definite predictions about the properties or behaviour 
of the object or phenomenon. In order to compute these predictions one (ex- 
cept for very simple models) requires simulation, because elaborated models 
consist of a set of nonlinear structure equations or evolutionary equations. 
Comparing the model predictions with observations allows one to constrain 
the parameters of the model or to verify or falsify certain model assumptions. 
In the latter case, one can try to improve or modify the model to achieve 
a better agreement with the observation, and start the procedure all over 
again. Thus, simulation provides the link between theory and observation in 
the iterative process to understand observed astrophysical phenomena. This 
is especially true for processes which are only indirectly observable, because 
they occur either inside the object or over time scales long compared to the 
human life span. In both cases simulation allows one to make phenomena 
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“visible”, which cannot be seen otherwise, and to determine from a variety 
of possible physical processes those which explain the observations. 



2.1 Procedure and Resources 



2.1.1 Hydrodynamic Approximation. The matter of astrophysical ob- 
jects can often be approximated as a gas or fluid. Two requirements must be 
fulfilled in order for this continuum approximation to hold. Firstly, one has 
to assume that the microscopic behaviour of single particles can be neglected. 
This requires that 



A < L , 



(2.1) 



where A is the collisional mean free path of the particles and L is a char- 
acteristic macroscopic linear dimension of the system, or a scale over which 
the distribution function varies significantly. If Eq. (2.1) holds, the concept of 
a fluid element can be introduced, the linear size of the fluid element being 
small compared to L but large compared to A. This implies that the number 
of particles in the fluid element is large, and hence mean physical quantities, 
like e.g-., , the fluid density q or the fluid velocity v can be defined for the fluid 
element. The velocity of the individual particles is then given by n — i; -h tn, 
i.e.jit possesses a random component w above the mean. Because A <C T, 
however, the particles only perform a random walk about the mean motion u, 
and consequently the fluid element remains well defined during the evolution 
except for some small loss of particles near its surface, which, if necessary, 
can be well described by an additional diffusion term in the hydrodynamic 
equations. 

Secondly, the forces between particles must be saturating or of “short 
range” , because otherwise collective effects must be taken into account. For- 
mally this can be expressed as 



lim 

/V— >-oo 




const. , 



( 2 . 2 ) 



where E/N is the energy per particle. For a fluid element with volume V 
one can then define an energy density e = EjV ~ n{EIN) where n = N/V 
is the number density, and a pressure p = n{d€/dn) — e exerted on the 
“walls” confining the fluid element. Examples of non-saturating forces are 
gravity and electromagnetic forces which both scale as (~ r~^). In the case 
of gravity the energy per particle E/N ^ for bosons and E/N ~ for 
fermions, respectively. Consequently, gravity must be included in the hydro- 
dynamic equations as a macroscopic external force. Although electromagnetic 
forces are long range too, they saturate in electrically neutral systems due to 
screening. 

If the hydrodynamic approximation holds, in the simplest case, the evo- 
lution of the matter in the astrophysical object is governed by a set of con- 
servation laws for mass, momentum and energy (see Sect. 2 of the lecture by 
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Le Veque for the actual formulation of the hydrodynamic equations). In the 
general case, additional equations and/or additional source terms have to be 
considered, which e.g., describe the effects of viscosity, nuclear burning, the 
diffusive transport of matter or radiation, the evolution of magnetic fields 
and their coupling to matter (magnetohydrodynamics), the evolution of ra- 
diation fields and their interaction with matter (radiation hydrodynamics; 
see lectures by Dorfi and Mihalas), and self-gravity. Moreover, in certain ap- 
plications the conservation equations have to be formulated consistently with 
special or general relativity (see Sect. 5, and Sect. 8 of the lecture of Le Veque 
and Sects. 5 and 6 of Mihalas). 

In astrophysical applications one usually has to solve the hyperbolic set 
of the inviscous and compressible Euler equations instead of the parabolic 
set of the viscous and compressible Navier-Stokes equations, because (except 
in shock waves) viscosity and heat conduction due to physical processes are 
often negligibly small. However, because of truncation errors and because of 
finite spatial resolution, all numerical methods proposed for the solution of 
the Euler equations posses some finite intrinsic numerical viscosity, whose 
form and size depends on the numerical method, on the grid resolution, and 
to some extent of the flow problem itself. The effect of the numerical vis- 
cosity on the solution can only be quantified by resolution studies, i.e.,by 
performing a series of simulations of the same flow problem using different 
grid resolution. Thus, when simulating astrophysical flow one is in the strange 
situation that one tries to solve the inviscous Euler equations, but instead 
solves some viscous variant, which is also different from the Navier-Stokes 
equations. 

2.1.2 Discretization. The evolution equations are a set of coupled nonlin- 
ear first-order (in time) partial differential equations. One way of solving this 
set of equations is to discretize them in time and space. By this discretization 
the partial differential equations are transformed into a set of coupled nonlin- 
ear algebraic equations, which can be solved on a computer with appropriate 
numerical techniques. As discretization introduces unavoidable errors, it is 
crucial to use discretization schemes, which minimize the errors. This is at 
the heart of the “art of computing” . 

Guided by the conservation properties underlying the hydrodynamic 
equations applied mathematicians and physicists have been able to develop 
accurate and stable high-resolution finite volume schemes (see Sect. 3 of the 
lecture by Le Veque) . In finite volume schemes the (finite) computational do- 
main is discretized into a finite number of zones or cells (typically 10^ to 10^ 
per spatial dimension). The hydrodynamic scalar fields (e.g., density) and the 
components of the vector fields (e.g., momentum) are approximated by a set 
of discrete values, which are the (approximate) cell averages of the respec- 
tive variable. Time is discretized too, the evolution being approximated by a 
series of finite time steps (typically 10^ to 10^ per simulation). 
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Besides finite volume schemes, another completely different method is 
widely used in astrophysics for integrating the hydrodynamic equations. This 
method is Smoothed Particle Hydrodynamics, or SPH for short. In SPH the 
fluid or gas is described by a finite set of “particles” (typically 10^ to 10^), 
which move with the flow (see Sect. 6 for more details). 

2.1.3 Required Computer Resources. In order to derive an estimate of 
the computer resources needed for a hydrodynamic simulation let us first 
estimate the number of floating point operations (flops). Per computational 
zone one has to evolve at minimum three variables (or equivalently three 
equations) in a one-dimensional simulation (density, one component of the 
momentum vector, energy) and at maximum about 10 to 20 variables in a 
three-dimensional simulation involving a small reaction network with a dozen 
or so nuclear species. The computational grid will have from 10^ (in ID) up 
to 10^ (in 3D) zones. The number of time steps to be performed ranges from 
about 10^ for simple dynamic flows to more than 10^ in cases where part of 
the flow is subsonic, or where some other physical process (like, e.g., neutrino 
transport) limits the size of the time step to a value much smaller than the 
dynamic time scale. Finally, per zone, per variable and per time step the 
modern algorithms (incorporating Riemann solvers) require about 10^ to 10^ 
flops. Significantly more floating point operations will be necessary, when one 
uses a complicated equation of state and/or has to evaluate reaction rates 
and transport coefficients. Multiplying all these factors one finds that one 
needs to perform of the order of 10^® (in ID) to 10^® (in 3D) floating point 
operations per simulation. 

Presently, a cheap desktop workstation is able to perform about 30 Mflops 
per second and a state-of-the-art parallel computer (e.g.,a CRAYT3E with 
512 processor elements) about SOGflops per second when simulating hydro- 
dynamic flows. Consequently, a simple ID simulation requires only a couple 
of minutes on a desktop workstation, while large 3D simulations will need of 
the order of several 100 hrs on the fastest computers available today. 

From the above numbers one can also estimate that a large 3D simulation 
requires computers with up to 10 GBytes of main memory, and will produce 
output, which can exceed 1 GByte per model or 100 GBytes per simulation. 
This shows that data handling and data analysis of large 3D hydrodynamic 
simulations is a nontrivial problem. Prom my personal experience I can say 
that the analysis of a simulation typically requires about twice to three times 
the effort than does the preparation and performance of the simulation itself. 
In this respect, as well as in many other aspects, large-scale computing is 
quite similar to performing large experiments. 

2.2 Some Basic Issues 

2.2.1 Lagrangian and Eulerian Formulation. The hydrodynamic equa- 
tions can be formulated with respect to two distinct classes of coordinate sys- 
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terns called Lagrangian and Eulerian coordinates, respectively (e.g., Potter 
1973, Chap.IX/1). 

Eulerian or spatially fixed coordinates, i.e., “purely” geometric coordi- 
nates, specify time-independent points in space. At each point the motion of 
the fluid is given by v{xi,X 2 ,X 3 ^t). In Lagrangian formulation the coordi- 
nates (xi,X 2 ,X 3 ) are the components of the radius vector r of a fixed fluid 
element, i.e., the coordinates are a function of time and of three parameters 
which identify the element. 

The Lagrangian formulation guarantees that no numerical, i.e., unphysical 
diffusion of momentum, heat or composition occurs during a simulation, be- 
cause the nonlinear advection terms responsible for the occurrence of numer- 
ical diffusion are not present in the Lagrangian formulation of the hydro- 
dynamic equations. Numerical diffusion causes especially difficult problems 
when trying to model (thermonuclear) burning fronts (see Sect. 4). The ad- 
vantage of the Lagrangian formulation breaks down, however, and severe nu- 
merical difficulties arise when multi-dimensional problems are to be treated. 
The comoving grid in general will become very distorted leading e.g., to grid 
tangling in the case of shear or vortex flow. Then one is forced to rezone 
the grid. Even when the rezoning is done carefully, which is not as easy and 
straightforward as one might think on first glance (note e.g., that the con- 
servation of mass, linear and angular momentum, and of energy should not 
be violated by the rezoning algorithm), a significant amount of numerical 
diffusion of the various quantities is introduced. Thus the major advantage 
of the Lagrangian approach is lost. 

For multi-dimensional problems, therefore, Eulerian hydro-codes using 
time-independent spatial coordinates are to be preferred, since the grid re- 
mains regular. Obviously then special efforts have to be used to minimize 
the inevitable numerical diffusion. This can be achieved using more accurate, 
higher-order schemes (see Sect. 3 of the lecture by LeVeque). 

Within the last twenty years, a new type of Lagrangian hydrodynamics 
method has been developed, the so-called Smoothed Particle Hydrodynamics 
or SPH method (Lucy 1977; Gingold k Monaghan 1977; Monaghan 1982). 
SPH is a free-Lagrange method, in which spatial gradients are evaluated 
without the use of a grid. Thus the method does not suffer from the problems 
caused by grid tangling and subsequent rezoning. The SPH method will be 
discussed in Sect. 6. 

2.2.2 Explicit and Implicit Methods. Following Potter (1973, Chaper 
II/4) let us consider a system defined by the state vector u(r, t), in the space 
domain R = R{r). If u = is defined at time ^ = 0, and if u is defined on 
the surface S of R for all time t, we wish to determine u for all time t in R. 
The state of the system may be obtadned for all time t as solutions to the 
initial value equation 

du 
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In general, L is a nonlinear operator which is algebraic for ordinary differential 
equations, and is a spatial differential operator for Eq. (2.3) being a partial 
differential equation. 

Neglecting higher than second-order terms the most general discretization 
of Eq. (2.3) with respect to time is given by (e.g., Potter 1973) 

-h Lu^{l - e)At -h Lu^'^^eAt , (2.4) 

where and are the state vectors of the system at adjacent time points 
and At. Here e is an interpolation parameter, 0 < e < 1, and 

second-order accuracy is only maintained when e ~ 1/2. In the special case 
when £ = 0, the new state is defined explicitly by the known state 
at the previous time step. In this event the method is called explicit^ while 
otherwise if £ 7^ 0 the method is called implicit 

Explicit schemes are only stable if the size of the time step is restricted by 
the famous Courant-Friedrichs-Lewy condition (see Sect. 3.1.10 of the lecture 
by Le Veque), which says that no information is allowed to travel more than 
one zone within a single time step. For the special case of a two-dimensional 
Cartesian grid the CEL condition is given by 



At < AtcFL 




The minimum in Eq. (2.5) is computed with respect to all zones (i,j), and 
Axi^ Ayj^Ui,Vj and Cij are the grid spacing and the flow velocities in x- and 
y-direction, and the local sound speed, respectively. 

In general implicit schemes allow for larger time steps but at the ex- 
pense of solving a system of nonlinear algebraic equations. This is achieved 
by linearizing the system and iterating the solution (e.g.,with the Newton- 
Raphson method). Thus implicit schemes require in each time step several 
(typically 3 to 5) times the solution of a linear system, i.e., several matrix in- 
versions. Obviously implicit schemes become prohibitively time and storage 
consuming when directly used for two or even three-dimensional problems, 
because the order of the matrix to be inverted is given by the product of 
the number of variables times the number of zones in each spatial direction, 
i.e.,7VV^*iVl.Ar2- A^3. 

Although operator splitting techniques can help to significantly reduce 
this problem (see Sect. 4 of the lecture by Le Veque) implicit schemes have 
not been widely used in astrophysics because of two further difficulties. Firstly 
the evaluation of the Jacobian required for the iterative solution of the non- 
linear system is a problematic task since some variables may only be given in 
tabular form (equation of state, opacities etc.). In addition, experience shows 
that errors made in the evaluation and in the programming of the numerous 
derivatives are difficult to locate, i.e., their removal may take a significant 
amount of the code development time. Secondly, when features are present 
in the flow across which one or several variables vary sharply and which 
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are neither stationary in the Lagrangian nor in the Eulerian reference frame 
(e.g., shocks and in particular accretion shocks) the advantage of the implicit 
approach can be greatly reduced. As convergence is only obtained if in all 
zones none of the iterated variables varies by more than about 10 to 30% 
from time step to time step, sharp non-stationaxy features in the flow will 
lead to a more or less severe time step restriction. This restriction can only be 
overcome with an adaptive grid, which however poses other sometimes even 
more challenging problems concerning the stability of the numerical scheme 
and the stiffness of the algebraic system (see lecture by Dorfi). 

2.2.3 Accuracy and Efficiency. Because multi-dimensional calculations 
place such a demand on resources and time, one should expend considerable 
effort in finding the best procedure. Accuracy - determined by testing versus 
known solutions, and efficiency - getting maximum performance from existing 
hardware, should be the main concerns. 

The computational expense of a given calculation is proportional to the 
number of zones calculated and the number of time steps taken. Thus, in 3D, 

Load ^ > (2.6) 

where I is a characteristic length. Ax is the (equidistant) zone size, r is the 
time the calculation is to be run, and St is the average size of a time step. For 
an explicit method, the size of the time step is limited by the CFL condition 
St < Ax/cs (see previous subsection), so that 

Load T Cs / (Ax)^ . (2.7) 

If one method (A) can produce equally accurate results, but uses n times the 
number of zones needed by another method (B), then the ratio of expense is 

Load(A) /Load(B) = . (2.8) 

The memory requirement scales as n^; in practice this is often the inflexible 
limit. One may argue for more computer time but the memory size is more 
difficult to change. 

Estimating n for different pairs of codes is not trivial, but depends upon 
the problem to some extent. An important problem of interest is the preser- 
vation of temperature or heat differences (when modeling burning fronts) and 
of compositional differences (when modeling mixing processes). A useful test 
problem is one which propagates a compositional step-function through the 
mesh, with no pressure gradients (alternatively one may use compensating 
steps in density and temperature) . The result should simply be a translation 
of the step to new positions. Numerical errors will tend to broaden the step 
into a “ramp” which spreads over several zones. Plotting the width, in zones, 
of the ramp as a function of the number of zones propagated through is a 
useful way to compare quality of numerical methods. 

Such a comparison is shown in Fig. 2.1, which is taken from Fryxell, 
Muller & Arnett (1989), who have tested the performance of a number of 
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Fig. 2.1. Comparison of several numerical methods for the translational motion of a 
contact discontinuity. The various curves give the width of the contact discontinuity 
(in grid zones) as a function of the number of zones it has been propagated through 
the grid. (Prom Pryxell et al. 1989) 



popular methods of multi-dimensional numerical hydrodynamics on the step 
test problem. These methods included Lax-WendrofF, Donor Cell or first- 
order upwind, Godunov’s method, a flux-vector splitting method (SADIE; 
Arnold 1985), and the piecewise parabolic method (PPM) of Colella & Wood- 
ward (1984). Most of these methods are discussed in Sect. 3 of the lecture by 
Le Veque. 

Fryxell, Muller & Arnett (1989) considered an equidistant one-dimensional 
grid, where a contact discontinuity has been propagated through 700 zones. 
Fig. 2.1 then shows that a version of PPM, without special software for de- 
tection of such contact discontinuities, has a ramp width of about 6 zones. 
Lax-Wendroff gives a ramp about 20 zones wide. PPM, with contact detec- 
tion, gives a ramp only two zones wide. SADIE, which does shocks very well, 
is as bad as Lax-Wendroff for this test. For both these methods, n 2 ^ 10. First 
order methods, such as donor cell and Godunov, are much worse (n 40 and 
n 2 :^ 30). 

The computational load for a given accuracy of SADIE or Lax-Wendroff 
(n 10, which is second best after PPM on the step test problem) relative to 
PPM (with contact detection switched on) is 10^ (in 2D) and 10^ (in 3D). This 
assumes that each grid point requires the same effort, which is not the case, 
but this is not a major error for problems which have significant microphysics. 
Note that the ratio of speed for a large parallel computer ( 2 ::^ 100 Gflops) and 
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a fast workstation lOOMflops) is approximately 10^, which is comparable 
to the difference in 3D performance for a resolution ratio of n ^ 6. 



3. Simulations of Core Collapse Supernovae 

Supernovae (SNe) are one of the most exciting astrophysical phenomena, in 
particular if they occur in our own galaxy as the historical supernovae did 
or if they explode in our immediate galactic neighborhood as the spectacular 
SN 1987A in the Large Magellanic Cloud (LMC). Besides being observation- 
ally exceptional events supernovae also pose one of the most challenging theo- 
retical problems in astrophysics involving multi-dimensional hydrodynamics, 
plasma physics, weak interactions, neutrino transport, nuclear physics, gen- 
eral relativity, thermonuclear burning fronts and turbulent combustion. 

Supernova explosions are either powered by the gravitational binding en- 
ergy released during the collapse of a stellar core to a neutron star (Baade 
& Zwicky 1934) or a black hole, or by the energy released during explosive 
burning of degenerate thermonuclear fuel (Hoyle & Fowler 1960). 

After briefly discussing observational facts the (prompt and delayed) ex- 
plosion mechanism of core collapse supernovae is reviewed. Subsequently, 
multi-dimensional simulations of core collapse supernovae are addressed in- 
cluding Rayleigh-Taylor instabilities in supernova envelopes, neutrino driven 
convection, and the effects of rotation. Finally, the gravitational wave signa- 
ture of core collapse supernovae is discussed. Obviously, many other aspects 
of core collapse supernovae had to be omitted in my lecture (e.g., progenitor 
evolution, influence of magnetic fields, high density equation of state, nucle- 
osynthesis, light curve modeling, spectral synthesis, radio signature, remnant 
evolution, SN 1987A), and some of the addressed topics are discussed only 
relatively briefly. For additional information, the reader is kindly referred to 
other reviews on the subject (e.g., Weiler & Sramek 1988; Arnett et aJ. 1989a; 
Hillebrandt & Hoflich 1989; Arnett 1996; and articles in Petschek 1990; Ray 
& Velusamy 1991; Bludman et ai. 1994, 95; McCray & Wang 1996; Ruiz- 
Lapuente et ai. 1997), 

3.1 Supernova Observations 

3.1.1 Classification, Light Curves and Spectra. Observationally, su- 
pernovae are divided into two distinct major spectral classes, which are dis- 
tinguished by the absence (Type I) or presence (Type II) of hydrogen lines 
in their spectra at any time (Minkowski 1941; Fig. 3.1). This immediately 
implies that if no spectrum is obtained for a supernova, its type cannot be 
determined. Due to a late discovery, incomplete coverage or a too small ap- 
parent brightness there is a lack of spectral data for many SNe, 

During the past two decades observations have further revealed that both 
types do not form a homogeneous set, but can be classified into distinct 
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subclasses which are called SNela, Ib, Ic, II-L and II-P, respectively (Fig. 3.1; 
for a review see, e.g.,Leibundgut 1995; Filippenko 1997). 

SNe II are divided on the basis of their light curves into the photometric 
“plateau” (SNell-P) and “linear” (SNe II-L) subclasses (Barbon et al. 1979). 
The rare class of SNIIn (e.g-.,SN 1988Z) and the possible existence of a 
separate class of faint SNe II similar to SN 1987A are not discussed here 
(e.g^., Filippenko 1991). The light curves of SNe II-L decline linearly past the 
epoch of maximum brightness, while those of SNe II-P exhibit a distinct phase 
of almost constant luminosity, the plateau, past maximum light between days 
to ^80 (Fig. 3.2). 



Maximum light spectra 



H / no H 




SN II 



Light curve shape 
max. light continuum 




II L IIP Mb 



(SN1987K, SN1993J) 



SNI 



Si / no Si 




la He / no He 

/ \ 



lb I c 

Fig. 3.1. Classification scheme of supernovae (from Harkness &: Wheeler 1990) 



Because of its distinctive plateau it seems to be safe to classify a supernova 
as type II-P on its light curve shape alone. However, disregarding the absolute 
luminosities the similarity of the light curve shapes of SNe I and SNe II-L 
(Fig. 3.2) implies that it may be dangerous to distinguish between these two 
classes on the basis of their light curves, in particular if one keeps the scatter 
of the individual supernova light curves around the mean light curve in mind. 
In case of incomplete light curve coverage a classification without spectral 
information will be almost certainly misleading. 

Compared to SNe II the light curve shapes of SNe I are broadly similar 
(Doggett & Branch 1985) except that the light curves of SNelb/c seem 
to be somewhat broader than that of SNela on average (Fig. 3.2). All SNe I 
light curves show a pronounced smooth maximum followed by an exponential 
decay, which is observed for SNell a late epochs, too (Fig. 3.2). SNela are 
intrinsically the brightest of all supernova types (Fig. 3.2). SNelb/c are about 
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Fig. 3.2. B template light curves for Type la, Ib/Ic, II-P and II-L supernovae (from 
Cappellaro et al. 1997) 

factor of 4 dimmer than SNela (Uomoto & Kirshner 1985) and seem to be 
on average slightly brighter than SNell (Leibundgut 1996), 

The three subclasses of the hydrogen-deficient (Type I) SNe are defined 
according to features in their optical spectra within about one month past 
maximum brightness (Figs. 3.1 and 3.3). 

Contrary to Type la supernovae (e.g., SN 1981B in Fig. 3.3), SNelb/c 
(e.g.,SN 1984L in Fig. 3.3) do not show the Sill A 6355 A blue-shifted ab- 
sorption feature at A 6150 A (Bertola 1964; Wheeler & Levreault 1985), and 
their infrared spectra do not exhibit the absorption feature at ~ 20 days, 
which characterizes the J, H and K light curves of SNela (Elias et aJ. 1985). 
In addition, at early times the spectra of SNelb contain strong optical He I 
lines, most prominently a distinctive absorption dip at ^ X 5700 A resulting 
from a Doppler-shifted He I line at A 5876 A, which are not present in the 
spectra of SNela at those epochs (Wheeler & Harkness 1990). 

SNe Ic lack obvious lines of both Si II and He I in their optical spectra. This 
subclass was proposed by Harkness & Wheeler (1990) and Wheeler & Hark- 
ness (1990) substituting the original categories “helium-rich” and “helium- 
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Fig. 3.3. Spectra of a Type la, Ib, IIP and II L supernovae (from Wheeler et 
aL 1987) 



poor” (Wheeler et aJ. 1987) for SNelb. However, whether SNeIc are really 
a distinct additional subclass requiring substantially different evolution and 
progenitors or whether they are just extremely helium deficient SNelb is 
still unclear (e.g., Leibundgut 1995; Filippenko 1997; Clocchiatti & Wheeler 
1997). Support for the latter “continuity hypothesis” comes from observa- 
tions, which show the presence of at least some helium in a subset of SNeIc, 
e.g.jSuch as SN 1987M (Clocchiatti et aL 1996) and SN 19941 (Filippenko et 
al 1995). 

Late time spectra of SNe I are very different from spectra neax maximum 
light. At late epochs, spectra of SNe la are dominated by strong blends of 
emission lines of [Fell], [Felll] and [Co III] with CaH and K in absorption 
but no evidence for O (Axelrod 1980), while those of SNelb/c show strong 
lines of 01, Mgl and Call in emission (Gaskell et ai. 1986; Axelrod 1988; 
Fransson 1988). 
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The spectra of SNe II are dominated by strong P-Cygni Balmer lines of 
hydrogen near maximum light and in the nebular phase (Zwicki 1965). The 
spectra further show [0 1] lines, which are strong in some events but only 
weak in others, and permitted and forbidden Call lines (e.g., Hillebrandt & 
Hoflich 1989). Whether SNell-L and II-P have distinctively different spectra 
and whether the observed variations in Type II spectra reflect some intrinsic 
differences in their physical properties is unclear, because the sample of well- 
studied SNe II-P is still too small (Harkness & Wheeler 1990). 

Finally, peculiar supernovae have been observed, like SN 1987K and 1993 J, 
which changed from canonically looking SNe II at early times to SNelb like 
objects with a little hydrogen at late epochs (Filippenko 1988). For this rea- 
son these supernovae are classified as SNellb (Woosley et ai. 1987; Filip- 
penko 1988). The metamorphosis of type shows that the current supernova 
classification scheme predominantly reflects differences in the pre-supernova 
evolution and the structure and composition of the progenitor, but not the 
underlying explosion mechanism (see Sect. 3.1.3). 

3.1.2 Frequency and Spatial Distribution of Supernovae. A first at- 
tempt to determine the frequency of SNe has been made by Zwicky (1938) 
more than fifty years ago. His study and various later investigations (for 
a review see, e.g., van den Bergh Tammann 1991; Tammann 1994) have 
qualitatively shown that the frequency of SNe is strongly dependent on the 
intrinsic properties of the parent galaxy, in particular on its morphological 
type and luminosity. Furthermore, SNe rates depend on the supernova type. 
Quantitative uncertainties in the determination of supernova rates arise, be- 
cause (i) a significant fraction of all SNe are discovered by chance, rather 
than during systematic search programs; (ii) the spectroscopic and photo- 
metric coverage of SNe is often incomplete even the type being unknown for 
a significant fraction of all discoveries; and (iii) inclination corrections to the 
SN discovery frequency in spirals are both large and uncertain as first pointed 
out by Tammann (1974). 

One of the latest and most comprehensive determination of the frequency 
of SNe is due to Cappellaro et aJ. (1997). Their data sample joins the logs 
of five SN searches (Asiago, Crimea, Calan-Tololo and OCA photographic 
surveys and the visual search by Evans) and consists of 110 SNe. References 
to earlier rate determinations can be found in Cappellaro et ai. (1997) and in 
the review by van den Bergh & Tammann (1991). 

The SN frequency determined by Cappellaro et ai. (1997) in different types 
of galaxies for different types of SNe is summarized in Table 1. The errors 
given in Table 1 are total errors estimated by adding quadratically the errors 
due to statistics, parameters (absolute and limited magnitudes, light curve 
shapes, etc.) and both nuclear and inclination biases. The rates are given in 
supernova frequency units or SNu, where 1 SNu = 1 supernova per 100 yr per 
10^^ Lq. Because of normalization to galaxy luminosity and because Cappel- 
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laro et al. (1997) assumed a value of the Hubble constant H of 75km/s/Mpc 
the rates scale as (H/75)^. 

In accordance with previous investigations Cappellaro et al. (1997) find 
that SNelb/c and SNell do only occur in spiral and irregular galaxies. They 
further find that the rate of SNela increases by about a factor of 2 moving 
from ellipticals (0.13 SNu) to late spirals (0.24 SNu). This confirms the work 
of Cappellaro et al 1993, but is in contradiction to the rates obtained by Tam- 
mann et ai. (1994), who determined a 4 times larger rate for the occurrence 
of SNela in ellipticals but a similar rate in spirals. According to Turatto et 
al. (1994) this disagreement is probably due to a bias of SN discoveries in the 
“fiducial sample” of galaxies used by Tammann et al. (1994). Further results 
are that the most prolific galaxies are late spirals where 2/3 of the SNe are 
SNell (0.88 SNu), and that SNelb/c are relatively rare events which amount 
to less than 40% of all SNe I (0.16 SNu in late spirals). 



Table 3.1. Supernova rates according to Cappellaro et ai. (1997) in supernova 
frequency units or SNu, where 1 SNu = 1 supernova per 100 yr per 10^^ L©. A 
Hubble constant H of 75km/s/Mpc was assumed, i.e., the rates scale as [H/lbY 



galaxy 

type 




la 


SN rate [SNu] 
Ib/c 


II 


E - 


SO 


0.15 ± 0.06 






SOa- 


Sb 


0.20 ± 0.07 


0.11 ± 0.06 


0.40 ± 0.19 


Sc - 


Sd 


0.24 ± 0.09 


0.16 ± 0.08 


0.88 ± 0.37 



From Table 1 one can estimate the expected rate of SNe in the Galaxy as- 
suming its type to be Sb and its luminosity to be 2 x 10^^ L©. In a millennium 
one expects 4 ± 1 SN la, 2 ± 1 SN Ib/c and 12 ± 6 SN II, or a total of 18 ± 8 
supernovae. Hence, the average interval between two supernova explosions 
is 40 - 100 years for our Galaxy. Note that the frequency at which galactic 
supernovae are observed on earth can be quite different from the supernova 
birth rate due to light-travel and extinction effects (Clark et al. 1981). 

3.1.3 Progenitors. According to present knowledge SNell are caused by 
the core collapse of a massive (M ^ 9M©) evolved star. This view is strongly 
supported by observations (in particular of neutrinos) of SN 1987A (Arnett 
et ai. 1989a). SNela are most likely thermonuclear explosions of degener- 
ate white dwarfs. SNelb/Ic are thought to be driven by the collapse of 
the evolved cores of maissive stars, which have lost their hydrogen or even 
part of their helium envelope by either a stellar wind (Wolf-Rayet progen- 
itors with M ^ 30M©) or, if being a member of a close binary, by mass 
transfer (helium star progenitors with 3M© ^ M ^ 5M© and initial masses 
12M© ^ Mi ^ 18M©). Some observational facts and theoretical considera- 
tions which support this general picture are outlined subsequently. A more de- 
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tailed discussion of the progenitor issue can be found in Wheeler & Harkness 
(1990), Branch et aJ. (1991), Woosley & Weaver (1994), Nomoto et al, (1995), 
Nomoto et ad, (1997) and Woosley & Eastman (1997), and in the references 
therein. 

SNela occur in all kind of galaxies and, in particular, also in elliptical 
galaxies (see Sect. 3.1.2), show no preference for spiral arms when they occur 
in spiral galaxies (Maza & van den Bergh 1976), and further show no evidence 
for hydrogen in their spectra (see Sect. 3.1,1). This suggests that their pro- 
genitors are low mass stars and belong to an old population of stars, because 
massive (young) stars (M ^ IOMq) are associated with spiral arms. As single 
stars are unlikely candidates (because it is difficult to understand, how they 
can get rid of their H-envelope and still explode; see e.g., Woosley 1990) the 
favourite model of Type la SNe involves an accreting white dwarf in a binary 
system, which when approaching the Chandrasekhar mass ignites its ther- 
monuclear fuel (He or a mixture of C/0) under degenerate conditions. The 
resulting explosion completely disrupts the star leaving no compact remnant. 
The light curve is powered by the decay of the radioactive ^®Ni, which was 
synthesized during the explosion, into ^®Co and eventually into ^®Fe (Hoyle 
& Fowler 1960; Arnett 1969; Nomoto & Sugimoto 1977). 

SNelb/c occur in spiral arms and close to or in HII regions (Porter & 
Filippenko 1987), which suggests massive progenitors (« 20 - 30Mq, where 
M© is the solar mass) that have been stripped of their hydrogen (SNelb) and 
possibly helium (SNe Ic) envelopes. Massive progenitors are further supported 
by the fact that SNe Ib/c, contrary to SNe la, are strong radio emitters (Weiler 
&: Sramek 1988). This is most easily explained, if the progenitor star lost its 
envelope by a stellar wind: the interaction of the supernova blast wave with 
the wind then can produce the observed radio emission. The small amount 
of 56Ni synthesized in the explosion of a massive star is also consistent with 
the observation that SNe Ib/c are intrinsically dimmer than SNela (Porter & 
Filippenko 1987; see Sect. 3.1.1), Still controversial is the mechanism, which 
leads to the (partial or complete) loss of the H/He-envelope prior to the 
explosion (e.g., Branch et ai. 1991; Leibundgut 1996). The envelope might 
be lost either by a stellar wind, i.e.,the progenitors are Wolf-Rayet stars 
(Wheeler & Levreault 1985; Begelman & Sarazin 1986; Gaskell et al. 1986; 
Schaeffer et aJ. 1987; Woosley et al. 1993; Woosley & Eastman 1997) or, if 
the progenitor is a member of a close binary, due to mass exchange with 
its companion (Ensman & Woosley 1988; Shigeyama et al. 1990; Nomoto 
et al. 1990; Nomoto et al. 1994; Woosley et al. 1994a; Nomoto et al. 1997; 
Woosley Sz Eastman 1997). 

Observational data strongly indicate that SNe II originate from massive 
hydrogen rich Population I stars, because they only occur in the arms of 
spiral galaxies (Tammann 1982; Maza & van den Bergh 1976), show strong 
hydrogen lines and are (at least the Typell-L) radio emitters (Weiler & 
Sramek 1988). Before the discovery of SN 1987A it was generally believed 
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that the progenitors of Type II SNe are red super-giants (e.g., Woosley & 
Weaver 1986) and not blue super-giants as in the case of SN 1987A. Stellar 
evolution calculations show that only (single) stars with a mass M ^ 8M© 
eventually form an iron core and explode (if their H-envelope has not been 
lost during the evolution in which case they may produce a SNIb/c) as a 
SNII (e.g,Woosley & Weaver 1986). 

The progenitors of SNell-L and SNell-P probably only differ in the 
amount of mass of the H-envelope (Chevalier 1984; Woosley & Weaver 1994). 
While the plateau in the light curve of SNell-P, which is caused by the 
recombination of hydrogen, is indicative of a large hydrogen mass (;^ IOMq), 
its absence in SNell-L implies, that their progenitors have lost most but not 
all of their H-envelope (Litvinova & Nadyozhin 1983). In this picture both 
kinds of SNe II are caused by the same explosion mechanism (i.e.,by core 
collapse). A different picture was suggested by Dogget & Branch (1985) on 
the basis of the similarity of the light curves of SNe la and SNell-L. They 
argued that SNe II-L might be caused by a thermonuclear explosion (see also 
Wheeler et aJ. 1987). 



3.2 Physics of Spherical Core Collapse 



Theoretical and numerical models of core collapse SNe (i.e.,SNeII and 
SNelb/c, see Sect. 3.1) are based on the idea, that the implosion of the 
iron core of a massive main sequence star (M ^ 9 Mq) at the end of its ther- 
monuclear evolution causes a supernova explosion with violent mass ejection 
(deject ^ 10"^ km/sec) and the formation of a compact remnant, f.e.,a neu- 
tron star or a black hole (for a review, see e.g., Bethe 1990; Hillebrandt 1994). 
The energy source for the explosion is the gravitational binding energy of the 
forming neutron star, which is of the order 
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where M and R are the mass and the radius of the neutron star, respectively. 
As predicted by theoretical models and as proven through observations of 
SN 1987A, the overwhelming fraction of the binding energy (about 99%) is 
carried away by neutrinos. Roughly 1% of the binding energy is transfered 
into kinetic energy (j^ 10^^ erg) of the ejecta, and only about 10“"^ of the 
liberated energy is emitted in form of electromagnetic radiation 10"^^ erg). 



3.2.1 Conditions at the Onset of Core Collapse. At the onset of col- 
lapse the central density and temperature of the degenerate core of a mas- 
sive star are Qc ^ lO^^gcm"^ and Tc ^ {S ~ 10) x 10^ K, respectively 
(e.g., Woosley & Weaver 1994). The core is mainly made out of iron group 
nuclei (hence iron core) and collapses on a time scale of Tcou ^0.1 sec, its 
dynamical (or free fall) time scale rdyn oc (^ is the average core density) 

being of the order of 1 ms (Fig. 3.4). 




3. Simulations of Core Collapse Supernovae 



361 




M(r)/Mc 



Fig, 3.4. Shortly after the onset of core collapse: The iron core, i.e.,the part of 
the star inside the Si-shell source, has a mass of about the Chandrasekhar mass 
and has begun to collapse homologously. Neutrinos, which are produced by electron 
captures, are freely escaping from the iron core at this stage 



Because of the high densities and temperatures in the iron core strong 
and electromagnetic interactions are in equilibrium. In this (local) nuclear 
statistical equilibrium (NSE) the composition of matter is determined by 
the Saha equation and is, therefore, independent of reaction rates. Further- 
more, the time scale of strong and electromagnetic reactions Trate '^dyn, 
i.e., hydrodynamic processes can be considered as adiabatic changes for these 
reactions. 

In the iron core energy transport by diffusion of photons and electrons 
can be neglected, because the corresponding diffusion length Adiff is much 
smaller than the typical density-scale height {d\n g/dr)~^ . Energy transport 
by heat conduction of degenerate electrons or ions and the viscosity of core 
matter can be neglected, too (Flowers & Itoh 1976). Consequently, except 
for shock waves and energy transport by neutrinos no dissipative process or 
transport phenomenon is of importance for the dynamics of the core, i.e., the 
iron core matter can be treated as an ideal fluid. 

3.2.2 Weak Interactions and Neutrino Transport. For densities g ^ 
3 X lO^^gcm'^ weak interactions are not in equilibrium, because r^eak ^ 
Tcoii- Important weak processes, which have to be considered in core col- 
lapse simulations, are electron captures (EC) on free protons and neutrino 
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absorption on neutrons, neutrino-electron scattering and coherent scattering 
of neutrinos off nuclei (e.g.,Bruenn 1985). Electron captures on nuclei are 
blocked due to shell effects, because in a neutron-rich NSE composition the 
most abundant nuclei possess closed neutron shells (Fuller 1982). 

In principle neutrinos and anti-neutrinos of all flavours, i.e,, 

Ur and Ur can participate in the weak processes. However, due to the strong 
electron degeneracy {rje ^ 20) for g ^ lO^^gcm"^ and thus fairly low tem- 
peratures {kT oc IMeV), the number density of electron neutrinos is much 
larger than the number densities of all other kind of neutrinos, because these 
neutrinos can only be created via pair and plasmon processes (e.g.,Bruenn 
1985). 

Initially neutrinos can leak out of the core, because the density (and hence 
the opacity of core matter) is relatively low (Fig. 3.4). The leakage of neutrinos 
results in a deleptonization of the core, i.e., the electron fraction (per nucleon) 
Yq decreases. The neutrino opacity during collapse is dominated by coherent 
scattering off nuclei mediated by neutral currents (Freedman 1974). The mean 
free path of neutrinos, therefore, is given by (Hillebrandt 1987) 



A. 
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(3.2) 



where {A) and {N) are the average mass number and neutron number of 
the core matter, and where is the neutrino energy. In shocked matter 
the main contributions to the neutrino opacity are due to scattering off free 
nucleons and absorption on neutrons (Bruenn 1985). 

For densities ^^3xl0^^gcm“^ the neutrino diffusion time scale Tdiff > 
Tcoii, which implies that the total lepton number >1 ^ + Ti, is constant 

during further collapse (Mazurek 1974; Sato 1975a, b). This epoch of the 
collapse is called neutrino trapping (see Fig. 3.5). In analogy to photons a 
neutrinosphere can be defined as the surface, where the optical depth is in 
the range 

dr 

2/3 ^ r(r) = j ^ < 1 • (3-3) 

Note that the location of the neutrinosphere depends on the energy of the 
neutrinos, i.e., the neutrino sphere has a variable spatial extent. The typical 
extension is of the order Ar„ « (10 - 30) km (Janka k Hillebrandt 1989a, b). 
Whereas inside and near the neutrinosphere transport processes are of im- 
portance, the neutrinos can freely stream further outside. 

Inside the neutrinosphere the trapped neutrinos thermalize by neutrino- 
electron scattering. The chemical potential of the Fermi distribution of 
the trapped neutrinos grows with increasing density until /3 -equilibrium is 
established at densities ^ ^ 3 x 10^^gcm“^, i.e., the weak reactions come 
into equilibrium and the chemical potentials (including the rest mass) of 
the electrons, protons, neutrons and neutrinos fulfill the relation /^e + A^p = 
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neutrinos are 
"trapped" 



Si-shell source 



Idiff > Icoliapse 

Fig. 3.5. Core collapse after neutrino trapping: The iron core has separated into 
a homologously and subsonically collapsing inner core and a supersonicsdly fcdling 
outer core. The mass of the inner core is about equal to the Chandrasekhar mass at 
this epoch. The Chcuidrasekhar mass has decreased from its initial value, because of 
electron captures (Mch oc At densities larger than about 10^^ gcm“^ neutrinos 
are trapped, i.e., their diffusion time scale is longer than the collapse time scale 



Mn + An important consequence is that the entropy remains constant 
after /^-equilibrium is reached, i.e., the further collapse proceeds adiabatically 
unless shocks are created. 

3.2.3 Equation of State (EOS). For a density g = lO^^gcm”^, a tem- 
perature T = 10^^ K and matter mainly composed of iron one has 

Pe > Plan » ^rad , (3.4) 

where Pe,Pion and Prad are the pressure of the relativistic and degenerate 
electron gas (cx the ion pressure and the radiation pressure, respec- 

tively. This inequality shows that the iron core is stabilized against gravity by 
relativistic electrons. It further implies that the adiabatic index 7 of the mat- 
ter in the iron core at the onset of collapse is close to the critical value 7 crit, 
where the fundamental radial mode of the core becomes unstable (against 
collapse) (Ledoux 1945): 
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A further small but not negligible contribution to the total pressure is the 
(negative!) pressure due to the Coulomb interaction of the ions (e.g.,Hille- 
brandt 1994). 

At “low” densities {g ^ 10^^ gcm~^) free nucleons are rare and nuclei are 
well separated. Therefore, both can be treated as Boltzmann particles. The 
EOS is then determined, provided the ground state binding energies and the 
nuclear partition functions are known (e.g.,El Eid k Hillebrandt 1980). At 
“high” but sub-nuclear densities (10^^ gcm“^ ^ ^ ^ 2 x lO^^gcm"^) the free 
nucleon density increases and starts to influence the properties of the nuclei. 
The nuclei become more closely packed and begin to interact. In this “high” 
density regime a microscopic model is required to compute the EOS, which 
incorporates nucleon-nucleon and nucleus-nucleus interactions, eflFects due to 
the flnite size of the nuclei and lattice effects due to Coulomb correlations (for 
a review see, e.g., Hillebrandt 1990; Vautherin 1994). At nuclear saturation 
density a phase transition occurs and nuclei and free nucleons merge into 
homogeneous, asymmetric (Ve 0.3) nuclear matter (Lattimer et al. 1985). 
The EOS gets very complicated, as many-body forces, relativistic effects, and 
extra hadronic and mesonic degrees of freedom must be taken into account 
(e.g., Vautherin 1994). 

Published EOSs show that the adiabatic index lies in the range 1.27 ^ 
7 ^ 1.325 for densities ^ ^ 2 x 10^"^gcm“^ and entropies 5 « Ike /nucleon 
(El Eid k Hillebrandt 1980; Hillebrandt k Wolff 1985; Lattimer et al 1985), 
where A:b is Boltzmann’s constant. If the density exceeds nuclear saturation 
density the EOS stiffens considerably due to the repulsive part of the nuclear 
forces (7 > 2.5). 

For a qualitative discussion of the influence of the EOS on the collapse 
dynamics it has been proven useful to approximate the EOS by a simple 
polytropic law P ~ K{Ye, S)g'^ ^ where the electron fraction the entropy 
S and the adiabatic index are in general density-dependent (van Riper 1978; 
Tohline 1984; Cooperstein k Baron 1990; Janka et al 1993). 



3.2.4 Collapse and Bounce. According to Chandrasekhar (1939) a star, 
which is supported against gravity by the (zero temperature) pressure of its 
relativistic electrons, can only be stabilized, if its mass is less than a critical 
mass 



Mch = 1.457 (2ye)''Mo , (3.6) 

which is the famous Chandrasekhar mass. At finite temperature the critical 
mass of a stable iron core should increase, but this increase is almost per- 
fectly canceled by a decrease of the critical core mass due to Coulomb lattice 
correlations (Baron k Cooperstein 1990). 

At the end of Si-core burning the electron fraction in the iron core of a 
massive star has a value 0.42 ^ Vg ^ 0.44, because of electron captures which 
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occurred during oxygen and (core) silicon burning. The central entropy of 
the core is of the order 1 ke /nucleon (Nomoto & Hashimoto 1988; Woosley & 
Weaver 1986, Woosley & Weaver 1988). The mass of the iron core eventually 
exceeds the Chandrasekhar mass, because (i) the mass of the core grows as 
a result of Si-shell burning, (ii) electron captures further reduce the lepton 
number in the core as neutrinos are lost from the star, and (iii) the pressure 
in the core is reduced due to (endothermic) photo-disintegration of nuclei by 
energetic photons (Fig 3.4). The relative importance of the last two processes 
depends on the stellar mass. For very massive stars (M ^ 20 Mq) photo- 
disintegrations are more important than EC’s, but the opposite is true for 
less massive stars, because of their lower central entropies 

The crucial quantity for the dynamical behavior of the core is the “effec- 
tive” adiabatic index (van Riper & Lattimer 1982) 
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which describes the change of the pressure along a collapse trajectory of a 
given mass element. If the EOS is approximated by a polytropic form, the 
“effective” adiabatic index before neutrino trapping (Fj = Fe) can be written 
for an adiabatic collapse {SS = 0 ) as 



4 1 SYe 
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(3.8) 



i.e.,r <7 if 5Yq < 0. This demonstrates that the density dependent electron 
captures reduce the adiabatic index, and therefore destabilize the core. 

The initial entropy of the core is small (5 ^ Iks /nucleon) and remains 
small during collapse {AS ^ 0.5), a crucial fact first noticed by Bethe et 
al. (1979). This near adiabaticity of the collapse has important consequences. 
Firstly most of the nucleons remain bound in nuclei, because the entropy of a 
nucleus of mass number A is much less (about a factor A) than the entropy of 
A free nucleons. Therefore, the pressure is dominated by relativistic electrons 
(and neutrinos after trapping) until nuclear matter density is reached, i.e., 7 ^ 
4/3. This in turn implies that the collapse of (non-rotating) iron cores cannot 
be stopped at densities less than nuclear matter density. 

Analytical calculations (Goldreich & Weber 1980; Yahil & Lattimer 1982; 
Yahil 1983) and numerical simulations show that the dynamics of core col- 
lapse can be understood qualitatively by means of self-similar solutions, 
i.e., solutions which have the same structure at all times. According to Gol- 
dreich & Weber (1980) there exists an exact stable homologous (i.e.,i;(r) oc r) 
solution for marginally stable poly tropes with P — kqq'^ and 7 — 4/3. Ob- 
viously, the size of the homologous region cannot extend out to arbitrarily 
large radii, because the infall velocity cannot exceed the local free-fall veloc- 
ity. Applying the Goldreich Sz Weber solution to realistic iron cores shows 
that the mass of the homologous part of the core shrinks (in mass) during 
collapse, because Kq oc Ye^^y which decreases through electron captures. 
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Fig. 3.6. Schematic profiles of the infall velocity versus radius at time t\ and t 2 > ti 
showing the decrease (in radius and mass) of both the homologous core and the 
inner core (IC) due to electron captures occurring during core collapse. The edge 
of the homologous core coincides with the position of the sonic point, and the edge 
of the IC is defined by the minimum of the infall velocity. The dashed lines give 
the (negative of the) local sound speed 



Yahil & Lattimer (1982) and Yahil (1983) have extended the work of 
Goldreich & Weber (1980) to poly tropes, where 7 = T is constant in space 
and time and 6/5 < 7 < 4/3. They found self-similar solutions for the whole 
core, which contain the Goldreich & Weber solution in the limit 7 = 4/3, 
but which are not exactly homologous for smaller values of the adiabatic 
index. In the Yahil &: Lattimer solutions the core splits into a subsonically 
and approximately homologously collapsing inner part and into an outer core, 
which is falling supersonically. 

Near the edge of the homologous core exists a sonic point, where the 
infall velocity exceeds the local sound speed (Fig. 3.6). Somewhat further out 
in radius, at r = (absolute value of the) infall velocity reaches a 

maximum. The mass inside the (time-dependent) radius is called the 

mass of the inner core (henceforth IC). According to numerical simulations 
the mass of the inner core lies in the range 

0.6 Mo ^ Mic ^ 0.9 Mo . (3.9) 

When the central density of the core exceeds nuclear matter density the 
EOS stiffens, i.e.,7 increases from a value somewhat below 4/3 to a value 
2.5— 3.0. According to the virial theorem the core can reach a new equilibrium 
position, if the (pressure weighted) average 7 > 4/3 in the core. Starting at 
the center of the core successive mass shells are stopped. Pressure waves 
move outwards in radius and accumulate within a millisecond near the sonic 
point, where they steepen into a shock wave (Fig. 3.7). Due to its inertia the 
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Fig. 3.7. Schematic velocity profiles versus radius at the point of “last good ho- 
mology” , at bounce and at the time, when the shock wave has just detached from 
the inner core 



homologous core overshoots its equilibrium position and rebounds behind 
the shock wave, which has already started propagating outward. The shock 
detaches from the rebounding core at r = at the outer edge of the 

IC (Fig. 3.7). 

This so-called bounce of the core occurs on the time scale of a radial 
pulsation, which is of the order 1 millisecond. For a stiff nuclear EOS the 
maximum density reached during bounce is less than two times nuclear sat- 
uration density, whereas for a soft nuclear EOS the collapse can continue up 
to 3 - 4 times, and for a very soft nuclear EOS (with 7 = 2) even up to 10 
times nuclear saturation density, if effects of general relativity are included 
in the calculation (Baron et aJ. 1985a, b). 

3.2.5 Prompt Explosions and Shock Propagation. The initial energy 
of the shock wave is approximately equal to the kinetic energy of the IC at 
the moment t = of last good homology (Brown et ai. 1982), i.e,, 

£^£ck«^kfn(< = 4). (3.10) 

During bounce the kinetic energy of the inner core is quickly transferred to 
the shock wave, which is an efficient absorber of sound waves. Note that 
the kinetic energy of matter between the sonic point (i.e., the edge of the 
homologous core) and (i.e., the edge of the IC) is dissipated before the 
shock detaches from the IC and, therefore, has to be included into the initial 
energy of the shock wave (Fig. 3.7). The moment of last good homology is 
defined as the time, when the center of the core begins to be decelerated by 
the stiffening nuclear EOS. 

Numerical simulations and analytical estimates (Monchmeyer 1993) show 
that the initial energy of the shock wave is of the order of 
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Fig. 3.8. Core collapse after shock formation: Due to the stiffness of nuclear matter 
the collapse has been stopped and a shock wave has formed at the edge of the inner 
core, which has approached approximate hydrostatic equilibrium. The central part 
of the inner core consists of homogeneous nuclear matter, while matter in its outer 
layers is made up predominantly of heavy neutron-rich nuclei. Matter from the outer 
core, which falls through the outward propagating shock wave is photo-disintegrated 
into free nucleons. This is a strongly endothermic process, which weakens the shock 
and eventually turns it into a standing accretion shock 

^shlck (4 - 10) X 10'' erg , (3.11) 

much more than is required to explain the observed (kinetic and electromag- 
netic) energy output of a supernova. However, while propagating outward the 
shock wave is severely damped by photo-disintegration of heavy nuclei into 
free nucleons (if S ^ 5 ke /nucleon) or alpha particles (if 3kB/nucleon ^ S ^ 
5 ke /nucleon), the energy loss being of the order (Hillebrandt, W. & Muller 
1981; Mazurek 1982; Fig. 3.8) 

Sioss = (1.6 - 1.8) X 10^1 erg / O.IMq . (3.12) 

After the shock wave has crossed the neutrino sphere, it is further weakened 
by neutrino losses. 

The shock wave also gains energy from the dissipated kinetic energy of 
matter passing through it. However, for spherically symmetric models it can 
be shown (Monchmeyer 1993) that the kinetic energy is not sufficient to 
guarantee an expansion of the shock heated matter if 7 1.4. Even in the 
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layers, where nuclei are completely disintegrated (7 ^ 1.5), most of the dis- 
sipated kinetic infall energy is needed to bring this matter into hydrostatic 
equilibrium while it is added to the quasi-static proto-neutron star. When 
most of the initial shock energy is lost due to photo-disintegration, the shock 
is no longer strong enough to lead to complete distintegration of the nuclei 
in the post-shock region and the value of the post-shock 7 drops below 1.4. 
The shock must turn into an accretion shock, if no further energy source is 
available. Such an additional source of energy is the heating of the shocked 
matter immediately behind the shock by energetic neutrinos diffusing out of 
the opaque central regions of the collapsed core (see below). 

Numerical simulations had to be performed to clarify, whether the energy 
losses of the shock are so severe that it stalls and turns into an accretion 
shock, or whether the shock wave is able to reach the outer layers of the star 
and cause a supernova explosion. As the (crucial) energy loss due to photo- 
disintegration is proportional to the difference between the mass of the iron 
core and that of the IC at bounce (Fig. 3.8), it is obvious that the best 
candidates for Type II SNe are stars with small iron cores (M ^12 Mq) and 
low initial entropy, the latter resulting in a large mass of the IC (Hillebrandt 
1987). In particular, the energy loss given in Eq. (3.12) implies a limiting 
mass, which can be photo-disintegrated by the shock, of the order 

Mcore-Mic ^ 0.45 Mo . (3.13) 

For an IC with a mass in the range 0.8 Mq ^ Mic ^ 0.9 Mq as found in 
numerical simulations a so-called prompt explosion can only occur, if Mcore < 
1.35 M©. 

Among researchers in the field there is now general agreement that when 
the detailed effects of neutrino transport are taken into account currently 
available initial models and in particular models with iron cores more massive 
than about 1.35 Mq cannot explode by the prompt mechanism, no matter 
what the right equation of state is (for more details, see e.g., Muller 1990). 

3.2.6 Delayed Explosions. A more viable Type II supernova explosion 
mechanism which relies on neutrino heating was found by Wilson (1985) 
(Bethe & Wilson 1985; Wilson et ai. 1986; Colgate 1989; Janka 1993). In 
this so-called delayed explosion mechanism energy is deposited in the layers 
between the nascent neutron star and the stalled prompt supernova shock 
during a period of a few 100 ms by absorption of a small fraction (between 
1% and 10%) of the neutrinos emitted from the collapsed core (Fig. 3.9). 
The neutrino energy deposition increases the pressure behind the shock front 
and the respective layers of the star begin to expand. Because of this ex- 
pansion an extended region of low density, but quite high temperature is 
created above the surface of the proto-neutron star. The persistent energy 
input by neutrinos maintains high pressure in the radiation dominated “hot 
bubble” region, thus driving the shock outward, which eventually causes the 
supernova explosion. 
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R = 80-1 00 km Rv = 50-70 km 
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Fig. 3.9. Sketch of the proto-neutron star and the “hot bubble” region several ten 
milliseconds after core bounce (see text for details) 



During the phase of matter accretion, lasting for up to more than hun- 
dred milliseconds after core bounce, material is falling through the standing 
accretion shock. Initially cooling processes dominate in this material, which 
radiates away its gravitational binding energy by neutrinos, and settles onto 
the proto-neutron star. As the neutron star becomes more and more compact 
and the accretion rate shrinks with the decreasing density of the infalling lay- 
ers of the progenitor star, there is a moment, when net heating by neutrino 
energy deposition must win against neutrino cooling. 

To see this, let us compare the most important neutrino processes in 
the hot, dissociated post shock material during the accretion phase, namely 
electron/positron absorption on protons/neutrons and the inverse reactions: 

e“ -h p < — > i/g + n , -\r n i — > + p . (3-14) 

The cooling rates per baryon due to the emission of and Pe? respectively, 
in these capture reactions can approximately be written as (see Janka 1993 
for a discussion and justification of the involved approximations) 
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Here, T denotes the gas temperature, Yn and Yp are the mean concentrations 
(i.e., numbers per baryon) of free neutrons and protons, respectively. L 52 is 
the i/q- or Pe“liiniinosity in 10 ^^ erg/s, ry gives the distance from the compact 
neutrino source in 10 ^ cm, and (e^) defines the average squared energy of the 
neutrinos. 



In the thin, hydrostatic, isothermal atmosphere of a neutron star with 
the pressure being dominated by a Boltzmann gas of baryons, the density 
declines exponentially with height. For the typical temperatures T MeV 
this implies that at sufficiently small densities relativistic electrons, positrons, 
and photons will start to yield the main contribution to the pressure, i.e., P oc 
T^. Hence, the matter in the gravitational field of the neutron star must 
be stabilized by a temperature gradient instead of a density gradient. In 
case of an adiabatic atmosphere, i.e.,^ oc T^, this requires a temperature 
decline T oc . However, for neutrino heating being balanced by neutrino 
cooling, — Q", Eqs. (3.15) and (3.16) imply T oc Hence, net energy 
deposition in a region close to the newly formed neutron star is unavoidable 
in the described situation. 



Simulations showed that neutrino heating indeed works as predicted by 
the analytical considerations discussed above (Wilson 1985; Bethe & Wilson 
1985; Wilson et aJ. 1986; Wilson & Mayle 1988; Colgate 1989; Bruenn 1993; 
Janka 1993). However, as pointed out by Janka (1993; see also Wilson Sz 
Mayle 1988), who performed one-dimensional simulations of the post bounce 
evolution until more than 10 seconds after bounce, observed explosion ener- 
gies cannot be obtained through energy deposition by the slowly decaying 
neutrino emission during the Kelvin-Helmholtz cooling of the proto-neutron 
star. Instead, he found that energetic explosions require the occurrence of 
an early phase of high neutrino luminosities (e.g., boosted by convection; see 
next section) with efficient heating in the vicinity of the neutron star. The 
main reason for this is that a sufficiently large mass must be heated and 
must store the energy deposited by neutrinos. Since processes like neutrino- 
electron scattering and i/i?-annihilation are much less efficient in transferring 
neutrino energy to the stellar matter, the supernova explosion does not gain 
much energy any more at the late epochs when the densities around the 
proto-neutron star are already rather low. 



3.3 Observations Demanding Nonspherical Models 

Various independent observational evidence exists that substantial non-radial 
motion and mixing has occurred during the explosion of SN 1987A (for a 
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review see, e.g., Arnett et ai. 1989a). This evidence includes the early de- 
tection of X-rays (e.g.,Dotani et a]. 1987; Sunyaev et ai. 1987; Wilson et 
ai. 1988; Pendleton et ai. 1995) and 7 -rays (Matz et ai. 1988; Mahoney et 
ai. 1988; Sandie et ai. 1988; Cook et ai. 1988; Gehrels et al. 1988; Teegarden 
et ai. 1989), both of which can most easily be explained if radioactive ^®Co 
had been mixed from the interior regions of the star into the envelope where 
the optical depth was much smaller. Assuming that the radioactive ^®Co is 
homogeneously mixed over a certain interior region of the star Arnett (1988) 
found that the radius of that region has to be of the order of 20 to 30% of 
the stellar radius in order to explain the time behaviour of the observed X- 
and 7 -ray flux from SN 1987A. 

This interpretation is supported by observations of the line widths of 
infrared emission lines of Fell (Erickson et ai. 1988; Haas et al. 1990; Colgan 
et ai. 1994) and of Nill, Aril, and Coll (Rank et aJ. 1988; Barthelmy et 
ai. 1989; Witteborn et ai. 1989; Colgan et ai. 1994). According to Haas et 
ai. (1990) and Colgan et ai. (1994) the inferred expansion velocity of the bulk 
of the Fell is about 2000 km/ s, and up to 4000 km/s for a significant fraction 
(8 to 30%) of the iron mass. However, this measured bulk velocity of Fell is 
significantly higher than the « 800 km/s predicted by spherically symmetric 
theoretical models at this epoch for those stellar layers, which should contain 
the iron and nickel synthesized during the explosion. Moreover, these layers 
were still inside the photosphere at the time of the infrared observations. 

Analyzing the light curves of several Fe, Co and Ni infrared lines Li et 
ah (1993) further found that the observations can be best explained, if these 
elements were mixed outward macroscopically and inhomogeneously in the 
form of about 60 to 100 identical clumps. Evidence for mixing and clump- 
ing also comes from asymmetries of the spectral line profiles (Spyromilio & 
Meikle 1990; Stathakis et al. 1991; Spyromilio & Pinto 1991; Spyromilio et 
al. 1993; Spyromilio 1994). A similar analysis showed evidence for clumping 
in SN 1987F (Filippenko & Sargent 1989) and SN 1988A (Spyromilio 1991). 

Another observational hint stems from the sudden appearance of fine- 
structure in the Balmer lines H^-H^j between day 20 and day 30 (so-called 
Bochum event; Hanuschik &: Thimm 1990; Hanuschik et al. 1993; see also 
Chugai 1991), which most likely is due to a local change in the excitation and 
ionization conditions, probably caused by radioactive decay of mixed-up ^^Co 
in the layers just above the photosphere. Similar observations in SN 1985L 
(Filippenko & Sargent 1986) and SN 1985P (Chalabaev & Cristiani 1991) 
indicate that such a kind of event may be generic to type II supernovae. 

The smoothness of the light curve is also indirect evidence of the need for 
mixing to occur (Arnett 1988; Woosley 1988; Shigeyama et al. 1988). Mixing 
of hydrogen into the center of the star allows for a time spread in the libera- 
tion of the recombination energy and thus helps to model the rather smooth 
and broad maximum observed in the light curve of SN 1987A (Shigeyama & 
Nomoto 1990). On the other side does mixing of heavies into the hydrogen 
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Fig. 3.10. Evolution of 
the density distribution 
of a 15 Mo star caused 
by the propagation of 
the shock wave (launched 
during 

core collapse) through 
the stellar envelope. The 
rapid drop of the density 
near composition inter- 
faces gives rise to a non- 
steady shock propagation 
which in turn causes den- 
sity inversions 



rich envelope lead to a homogenizing of the opacity, which again smoothes 
the light curve (e.g., Arnett et aJ. 1989a; and references therein). 

Finally, strong observational support for the generic nature of mixing 
processes in supernova envelopes (of SNell and SNelb/c) comes from the 
bright SNIIb supernova SN 1993J in the nearby (3,6 Mpc) galaxy M81 (for 
a review on SN 1993J see, e.g., Wheeler & Filippenko 1996). Mixing of the 
ejecta was inferred from observed asymmetric profiles of the [0 I] and [Mg I] 
emission lines at late times (Spyromilio 1994; Wang & Hu 1994) and from 
light curve modeling (Shigeyama et al. 1994; Woosley et aJ. 1994b). 

Theoretically, there exists no obvious reason that instabilities, mixing and 
clumping in supernova envelopes and ejecta are specific to SN 1987A. These 
processes will occur generically in core collapse supernovae, because the shock 
wave launched during core collapse will always encounter composition inter- 
faces (i.e., rapid drops of density) on its way out through the stellar envelope 
(Fig. 3.10). Hence, at these interfaces the acceleration of the shock, caused by 
its propagation down the density slope, reaches a local maximum. The rel- 
ative deceleration of the shock wave past the composition interface causes a 
reverse shock and eventually an unstable density and pressure stratification. 
Consequently, instabilities and mixing are unavoidable. Only the strength 
and the number of instabilities (composition interfaces) will vary from star 
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to star. Actually, observation of mixing, inferred from light curve and spectra, 
can provide interesting diagnostics of the internal structure of the progenitor 
(e.g.,Iwamoto et al, 1997). 

3.4 Rayleigh— Taylor Instabilities in Supernova Envelopes 

As stated by Chandrasekhar (1961): The Rayleigh- Taylor instability derives 
from the character of the equilibrium of an incompressible heavy fluid of vari- 
able density. An important special case in this connexion is that of two fluids 
of different densities superposed one over the other (or accelerated towards 
each other). In order to gain some insight in the Rayleigh-Taylor (henceforth 
RT) instability I will discuss two special cases in some detail. 



A 




Fig. 3.11. Two uniform fluids of 
constant density and Q 2 sepa- 
rated by a horizontal boundary at 
z — 0. The fluids are subject to 
an acceleration g {e.g., due to grav- 
ity) acting in negative direction. If 
Q 2 > Qi the arrangement of fluids is 
Rayleigh-Taylor unstable 



Following Chandrasekhar (1961) let us consider the static case of two uni- 
form fluids of constant density qi and p 2 separated by a horizontal boundary 
at z = 0 (Fig. 3.11). The fluids are subject to an acceleration g (e.g.,due 
to gravity) acting in negative z-direction, and the interface between the flu- 
ids is subject to forces arising from surface tension T. Suppose further that 
this system is slightly perturbed. Then by a normal mode analysis of the 
disturbance, i.e., seeking solutions of the form exp{ikxX + ikyp + nt), where 
kx^ky and n are constants, the following dispersion relation can be derived 
(Chandrasekhar 1961) 
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(3.17) 



Here the growth rate r = and the absolute value of the wave vector of 
the perturbation k = {k^ + According to Eq. (3.17) the arrangement 

of fluids is stable^ if Q 2 < Qi, i.e., when the lighter fluid is on top of the heavier 
fluid, because < 0 in that case. Otherwise, if Q 2 > Qu the arrangement 
of fluids is unstable for all wave numbers in the range 0 < A: < where 
kc = [(^2 - Q\)glTY^'^, but is stable for wavenumbers k > k^y because 
surface tension succeeds in stabilizing short wavelength perturbations. 
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Let us consider a second simple case, namely that of a fluid confined 
between two rigid planes at 2 ; = 0 and z = d which has an exponentially 
varying density g = Qo exp{Pz) with a typical (constant) density-scale height 
/3“^. The corresponding dispersion relation is (Chandrasekhar 1961) 



r-^=90 



k^cP 



(3.18) 



where m is an integer > 1. Obviously the stratification is stable if f3 is neg- 
ative^ while it is unstable if /3 is positive. Note that the modes with shortest 
growth time have m = 1, and that if d^k and m are given r is smallest, if 

Since the energy given to the shock wave in a supernova explosion is much 
larger than the binding energy of the mass outside the collapsed core, gravity 
is dynamically unimportant for the propagation of the shock wave through 
the stellar envelope. Thus, as first pointed out by Chevalier, the only way to 
drive a RT instability is by pressure and density gradients of opposite signs, 
the “effective acceleration” being the (negative) pressure gradient, i.e., 

1 dp 

^ Q dr 

In order to be of any consequence for the supernova explosion the RT 
instability growth time, trx> obviously must be shorter than the hydrody- 
ngtmic time scale, Thydro = 'f'sh/^sh, where Vsh suid Ush are the radius and 
the velocity of the shock wave. Substituting g in Eq. (3.18) according to rela- 
tion (3.19) Chevalier (1976) derived the following RT instability criterion for 
power-law density distributions: 

KS) -KS)'(S)) 

where Hi = | (9 In P/5r) and i ?2 = |(51ne/3r) are the (absolute) 

pressure and density-scale heights, and where Cg is the local sound speed. 



— > 1 , (3.20) 
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3.5 Simulations of RT Instabilities in Supernova Envelopes 

Soon after the first observational evidence was obtained that instabilities and 
mixing had occurred in SN 1987A (see Sect. 3.3), several groups began to 
perform multi-dimensional simulations using polytropic and realistic stellar 
models (Arnett et al. 1989b; Muller et aJ. 1989; Benz & Thielemann 1990; 
Den et al. 1990; Hachisu et aJ. 1990; Yamada et al. 1990; Pryxell et al. 1991; 
Hachisu et aJ. 1992; Herant & Benz 1991; Muller et al. 1991a, b,c). In all these 
simulations the explosion was started artificially by liberating the typical 
type II supernova (kinetic) energy of about 10^^ erg in the innermost zones 
of the stellar model. Thus, it was assumed (because of numerical reasons 
and because of our ignorance concerning the explosion mechanism) that the 
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propagation of the shock wave through the stellar envelope can be considered 
without a detailed treatment of the explosion mechanism. We shall see below 
that this assumption has to be abandoned. In order to trigger the instability 
the initially spherically symmetric stellar models had to be perturbed, the 
amplitudes of the perturbations ranging from 0.1 to 10%. 

The hydrodynamic simulations showed in particular that Rayleigh-Taylor 
(RT) instabilities occur in realistic stellar models near composition interfaces 
and that the instabilities reach the nonlinear stage characterized by narrow 
finger-like structures which are topped by “mushroom caps”. About 5hrs 
after the onset of the explosion the He-shell is shredded into clumpy fragments 
the density contrast exceeding a value of five (Fig. 3.12; for more details see, 
e.g., Muller et ai. 1991b, c). Three-dimensional calculations confirm the results 
obtained in two-dimensions (Muller et ai. 1991a), and also show pronounced 
instabilities producing clumpy structures, which have a density up to a factor 
of ten higher than the surrounding matter. 

Furthermore, Herant & Benz (1991, 1992) showed that the mixing in- 
duced by RT instabilities occurring during the first few hours is substantially 
modified at later times by the radioactive decay of ^^Ni and ^^Co, which 
increases the peak velocity of the nickel and of its decay product iron by 
approximately 30%. These “boosted” iron velocities are consistent with the 
velocities deduced from infra-red iron lines for the bulk of the observed Fe 
(Haas et ai. 1990). However, they fail to explain the high velocity wings of the 
lines, which extend to velocities ^ 3000 km/s, and which, according to Haas 
et ai. (1990), represent between 8 and 30% of the iron. The 17.94 and 25.99 
/xm Fell line profiles observed by Haas et ai. (1990) and Colgan et ai. (1994) 
also contain an unresolved 3 — 5cr emission feature, which can be interpreted 
as a high velocity clump of material containing 3 % of the total iron mass 
and moving with a velocity of about 3900 km/s. 

The results of the hydrodynamic simulations strongly indicate that the 
mixing induced by the RT instabilities in the envelope is insufficient to ex- 
plain the observations of SN 1987A and, in particular, the early discovery of 
7 -rays, the large velocities of iron, and the clumpy structure of the ejecta. 
This quantitative failure of the simulations may be taken as evidence that 
non-radial motion must have occurred in type-II supernovae already dur- 
ing the very early phases of the explosions, e.g., induced by the convectively 
unstable hot bubble created by neutrino heating in the “delayed explosion” 
scenario (see Sects. 3.2.6 and 3.6). Concerning the simulations this also im- 
plies that the formation of the shock wave and its propagation cannot be 
studied independently, as it has been done in all investigations up to now. 

Finally, multi-dimensional hydrodynamic simulations have shown that RT 
instabilities do not only occur in SNell explosions of blue super-giants as 
SN 1987A, but do also occur in SNellP explosions of red super-giants (Herant 
& Woosley 1994; Shigeyama et aJ. 1996) and in helium star models of SNelb 
(Hachisu et aJ. 1991, 1994; Iwamoto et ai. 1997). 
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Fig. 3.12. Helium mass fraction computed on a moving grid of 800 x 400 zones in 
spherical coordinates (r, 6) for an explosion energy of 10^^ erg and an initial random 
perturbation of 10 % amplitude about 4 hrs after the onset of the explosion. The plot 
shows only a part of the (moving) computational grid up to a radius of 2.7 x 10^^ cm. 
The helium shell (bright grey) has been shredded by the RT instabilities. (From 
Miiiler et a/. 1991b) 



3.6 Neutrino Driven Convective Instabilities 



The criterion for the occurrence of convection in matter with density p, en- 
tropy per baryon S and electron number fraction is given by (Epstein 
1979) 
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where r is the radial coordinate. Hence, a sufficient condition for convective 
motion is given, when both the gradients of entropy and electron number 
fraction are negative, j.e.,d5/dr < 0 and dFe/dr < 0. 

3.6.1 Convection Inside the Proto-Neutron Star. After the shock 
wave stalls 10 ms after bounce) and before the delayed neutrino heat- 
ing mechanism begins to become effective 100 ms after bounce) the outer 
parts of the proto-neutron star are convectively unstable because of two rea- 
sons. Firstly, the deleptonization occurring in the shocked matter outside the 
neutrino-sphere produces a negative lepton gradient. Secondly, the weakening 
(and the eventual stalling) of the prompt shock wave gives rise to a negative 
entropy gradient in the same region. According to Eq. 3.21 this is a suffi- 
cient condition for convective motion to occur. This situation is commonly 
encountered in supernova simulations (Burrows 1987; Burrows & Lattimer 
1988; Bruenn 1993). 

The existence of convective instabilities inside the proto-neutron star has 
been demonstrated in two-dimensional (Burrows & Fryxell 1992, 1993; Janka 
& Muller 1993, 1996a; Mezzacappa et al. 1998a) and three-dimensional hy- 
drodynamic simulations (Muller 1993; Muller & Janka 1994, 1997), which 
covered the onset and growth of the convective instability until « 50 ms af- 
ter core bounce. In all simulations an inner boundary condition at a fixed 
finite radius was used in order to save a significant amount of computer time. 
Hence, the flow was not simulated in the innermost part of the proto-neutron 
stax- 

The evolution of the convective instabilities in the mantle of proto-neutron 
stars is illustrated in Fig. 3.13, which is taken from Muller & Janka (1997) 
and which shows the time evolution of the electron number fraction in a 
region between 15 km (the radius of the fixed inner boundary) and 155 km. 
The shocked and neutronized material starts to develop finger-like structures 
on growth time scales of roughly 10 ms penetrating inward (right upper panel 
in Fig. 3.13). As the convective velocities reach or even exceed the local sound 
speed, significant undershooting and overshooting occurs at the inner and 
outer boundaries of the unstable region, respectively. The initially narrow 
fingers begin to merge into successively larger blobs. After another 5-10 ms 
the whole unstable region is involved in a convective overturn (left lower panel 
in Fig. 3.13). Finally, after about 20 ms the inner region is completely mixed 
and homogenized. 

The extent of overshooting depends on whether the flow is simulated in 
two or three space dimensions. A mode analysis performed by Muller & Janka 
(1997) reveals that in 2D the convective structures are roughly a factor of two 
larger than in 3D, where the dominant modes have (angular) wavelengths of 
about 20^ — 30^ (i.e., clump sizes of about 10® — 15®). Consequently, the 2D 
structures have more inertia and give rise to a larger amount of overshooting 
(50% more in 2D than in 3D). 
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Fig. 3.13. Convection inside the nascent neutron star. The panels show the time 
evolution of the electron number fraction Ye in a region between 15 km and 155 km. 
The panels are arranged in counter-clockwise order, starting from the right upper 
side, and show snapshots at 12 ms, 14 ms, 18 ms, and 21 ms after the start of the 2D 
simulation. The color levels correspond to Te- values in the range [0.1, 0.5] increasing 
from blue, over green, yellow and orange to red. Blue regions have Ye < 0.12, yellow 
regions correspond to 0.13 ^ Ye ^ 0.16, and orange regions to 0.16 ^ Ye ^ 0.21. 
(Prom Muller &: Janka 1997) 



As the convective velocities reach and even partially exceed the local 
sound speed (uconv ^ 10® cm/s) strong pressure waves and even weak shock 
waves are generated by the convective flow. This convective activity inside the 
proto-neutron star causes significant inhomogeneities in the temperature and 
density stratifications. The inhomogeneities vary in time and space implying 
non-radial mass motions and a time-dependent mass quadrupole moment as- 
sociated with the outer layers of the proto-neutron star. Both effects generate 
gravitational radiation with typical frequencies in the range of several hun- 
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dred Hz to about one kHz (see Sect. 3.8). An additional gravitational wave 
signal is produced by the anisotropic neutrino emission found in the multi- 
dimensional models (Epstein 1978; Burrows & Hayes 1996; Muller k Janka 
1997). 

The convective mixing found in the multi-dimensional simulations around 
and below the neutrino-spheric region near the proto-neutron star surface 
is accompanied by an increase of the neutrino luminosities during the early 
phase of the supernova explosion. Since the convection is dynamic and violent 
and the gas velocities are close to the local speed of sound, neutrinos are 
transported out of the dense interior of the newly-formed neutron star much 
faster than by diffusion. 

Recently, the first two-dimensional hydrodynamic simulations of the 
deleptonization of a newly formed neutron star were performed by Keil 
(1997); see also Keil e^; a/. (1996). These simulations cover about 1.2s of the 
proto-neutron star’s evolution. They were performed in spherical coordinates 
using a grid of 100 (moving) radial zones and 60 angular zones (1.5 degree an- 
gular resolution). Neutrino transport was computed in diffusion approxima- 
tion in radial direction for every angular zone, i.e., angular transport of neutri- 
nos was neglected. Keil (1997) found that driven by negative lepton-fraction 
and entropy gradients, convection starts near the neutrinosphere about 20- 
30 ms after core bounce. This is in agreement with previous simulations (see 
above), which however ended at this stage. As Keil (1997) pushed his simu- 
lations further in time, he found that convection continues and moves deeper 
into the protoneutron star. After about one second the whole protoneutron 
star is convective. Keil’s (1997) simulation further show that deleptonization 
of the proto-neutron star proccedcs much faster, when convective transport 
is included. Compared to a ID model, the lepton flux and the neutrino lu- 
minosities are increased by up to a factor of two in his 2D simulation. The 
convection below the neutrinosphere also raises the neutrinospheric temper- 
atures and mean energies of the emitted neutrinos by 10-20%. These effects 
can have important implications for the supernova explosion mechanism and 
will change the detectable neutrino signal from the Kelvin Helmholtz cooling 
of the proto- neutron star. 

3.6.2 Convection in the Hot Bubble Region. Noting that the hot bub- 
ble is convectively unstable Colgate (Colgate 1989; Colgate et al. 1993) and 
Bethe (1990) pointed out that this situation may give rise to a dynamical 
overturn of hot, neutrino-heated, rising material and cold post-shock matter, 
and can lead to large-scale deviations from spherically symmetric supernova 
explosions. This instability was indeed found in numerical simulations (Bur- 
rows et al 1995; Herant et al. 1992, 1994; Janka k Muller 1995a, b, 1996a, b; 
Mczzacappa et al 1998b; Miller et al. 1993; Muller k Janka 1994; Shimizu et 
al 1993; 1994; Yamada et ai. 1993). 

In order to describe the formation of the convective instability and to 
simulate the convective motion in the “hot bubble” region an adequate treat- 
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ment of neutrino-matter interaction is necessary and the generation and 
transport of neutrinos must be modeled. This can only be done in some 
approximate manner, because a multi-dimensional, multi-flavor and energy- 
dependent treatment of neutrino transport is presently not feasible. More- 
over, the neutrino luminosities from the collapsed stellar core are dependent 
on convective processes below and around the “neutrino sphere” (see previous 
subsection) and cannot be accurately determined without an elaborate treat- 
ment of neutrino transfer in this turbulent region. Possibly even methods to 
simulate neutrino transport in more than one-dimension have to be applied. 
Finally, the neutrino emission depends on the uncertain nuclear equation of 
state and on the neutrino interaction rates in the high-density material, which 
are only known with an accuracy of, may be, a factor of two. 

Because of this lack of knowledge, Janka & Miiller (1996b) have performed 
a sensitivity study instead of calculating a consistent but specific model. Cer- 
tainly, consistency of the modeling is preferable. However, selecting a specific 
physical input, like EOS and neutrino opacities, as it was done by other 
workers in the field, provides only little insight into the physical conditions 
required for a successful explosion. Instead, Janka & Muller (1996b) took a 
published model of a stellar core shortly after core bounce (Bruenn 1993) and 
replaced the inner high-density region somewhat inside the neutrino-sphere 
by an inner boundary condition, which was allowed to contract in a controlled 
way and where the neutrino fluxes (and spectra) were set to chosen values. 
This allowed them to study the impact of the neutrino energy deposition and 
convection in the outer part of the collapsed core on the mechanism of the 
explosion. 

Janka & Muller (1996b) obtained powerful explosions and cases of failure 
or very weak explosions in both one-dimensional and two-dimensional simula- 
tions. The crucial parameter turned out to be the neutrino luminosities of Ue 
and i>e after bounce and their decrease with time. Strong explosions require 
sufficiently high neutrino fluxes for a sufficiently long period. The turbulent 
overturn of cold post-shock material and high-entropy, neutrino-heated gas 
provides an efficient mechanism to transport energy from the neutrino-heated 
region to the shock front and clearly aids the re-expansion of the stalled shock 
front. However, turbulence in the neutrino-heated layer is a crucial help for 
the explosion only in a narrow range of neutrino luminosities. For lower neu- 
trino fluxes the neutrino-heating is too weak and powerful explosions cannot 
be obtained even with convection. For higher neutrino fluxes convection re- 
duces the neutrino luminosity that is necessary to end up with a certain 
explosion energy (for more details see Janka & Muller 1996b). 

The nonspherical stratification and mass flow resulting from the convec- 
tive instabilities in the neutrino heated region are, like the ones occurring 
inside the proto-neutron star, a source of gravitational radiation. Because of 
the longer time scales (up to a few 100 ms) and the larger radii (up to several 
1000 km) involved in the “hot bubble” convection, the typical frequencies of 
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the emitted gravitational radiation are expected to be significantly lower and 
the corresponding amplitudes to be smaller (see Sect. 3.8). 

The evolution of the convective instabilities in the neutrino heated re- 
gion and the resulting supernova explosion are illustrated in Figs. 3.14, 3.15 
and 3.16, which are taken from Janka & Muller (1996b). Behind the shock 
high-energy neutrinos from the hot interior transfer energy to their reaction 
targets and heat the stellar matter. Due to the heating an unstable entropy 
gradient builds up between the shock and the neutrinosphere. Depending on 
the imposed neutrino flux at the inner boundary, the neutrino heated region 
becomes convective after about 50 -SO ms. This stage of the evolution is il- 
lustrated in Fig. 3.14, which shows two convective regions: (i) the lepton and 
entropy unstable layer inside the proto-neutron star just below the neutrino- 
sphere, and (ii) the entropy- unstable neutrino heated region. Note that the 
black, central, spherical region visible in Fig. 3.14 corresponds to the inner 
O.85M0 of the proto-neutron star not treated in the simulation. 

As a consequence of the transport of energy from the region of strongest 
neutrino-heating into the layers just behind the shock front, the pressure 
behind the shock rises, the shock speeds up, and the explosion of the star 
gains momentum. Fig. 3.15 shows the entropy distribution inside a radius of 
1590 km at a time 170 ms after the start of the simulation. The supernova 
shock can be recognized as the deformed (outer) discontinuity. Post-shock 
material penetrates inward, while neutrino-heated, high-entropy gas rises in 
expanding bubbles. The further evolution (in a different simulation) is illus- 
trated in Fig. 3.16, which shows the growth of the dynamically expanding 
region between the central neutron star and the supernova shock at 0.25 s, 
0.45 s, and 0.65s after shock formation (from top to bottom). The supernova 
shock is visible as the deformed discontinuity. Its (average) radial position 
grows from about 3 x 10^ cm in the top figure to 6 x 10^ cm in the middle 
and more than 9 x 10^ cm in the lower figure. Continued heating by neutri- 
nos increases the energy of the explosion during a period of several hundred 
milliseconds from 5 x 10'^^ erg at f = 0.1 s to 1.1 x 10^^ erg at f = 0.95 s. 

As the shock moves further out, the expansion of the post-shock region 
becomes faster, and the density between shock and neutron star drops rapidly. 
This is the moment {t ^ 0.5 s) when the explosion energy saturates and a very 
low-density, high-temperature bubble around the proto-neutron star appears 
(lower panel in Fig. 3.16). At the end of our calculation {t = 0.95 s) the shock 
wave has reached a radius of nearly 2 x 10^ cm. 

3.7 Rotational Core Collapse 

It is well known from observations that massive stars (spectral types 0- 
F) have surface equatorial rotational velocities on the main-sequence in the 
range lOOkm/s ^ usin(z) ^ 300km/s, where i is the inclination angle be- 
tween the axis of rotation and the light of sight (for a review, see e.g.,Tassoul 
1978). However, the initial angular momentum distribution and its evolution 
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Fig. 3.14. The snapshot shows down-flows of cool, lepton-rich gas from the post- 
shock region (red to orange) and rising bubbles of neutron enriched, neutrino-heated 
gas in the inner part of the collapsed stellar core out to a radius of 333 km about 
80 ms after the start of the simulation. The flow pattern is emphasized by velocity 
vectors. One can also recognize (blue region near the center) the convective activity 
below the neutrino-sphere. (Prom Janka & Muller 1996b) 
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Fig, 3,15. Entropy (in kB/nucleon) distribution in the central part of the star 
out to a radius of 1590 km at a time 170 ms after the start of the simulation. The 
supernova shock can be recognized as the deformed discontinuity between the dark 
blue and light blue regions. Post-shock material penetrates inward, while neutrino- 
heated, high-entropy gas rises in expanding bubbles. (From Janka & Muller 1996b) 
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Fig, 3.16. Entropy (in kB/nucleon) distribution in the central region of 12300 km 
radius at times t = 433 ms, 551ms, and 649 ms after the start of the simulation 
(from top to bottom). The supernova shock is visible as the deformed discontinuity 
and moves from about 6000 km to 10 000 km within 200 ms. The region behind the 
shock shows anisotropy on large scales with contrasts in the entropy of about a 
factor of 3. The darker area visible at late times in the center of the star indicates 
the formation of a nearly spherical, low-density, high-entropy hot-bubble region 
around the proto-neutron star. (Prom Janka &: Muller 1996b) 
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with time are unknown. The latter is determined by the amount of angular 
momentum transport occurring during the stars’ lifetime. If transport of an- 
gular momentum is inefficient, the increasing compactification of the central 
regions of the evolving massive star would lead to relatively rapidly rotating 
iron cores, even if the star was rotating slowly on the main-sequence (Endal 
& Sofia 1977). If, on the other side, magnetic fields or shear viscosity are able 
to reduce the angular momentum in the interior of the star sufficiently, its 
iron core will rotate slowly. Unfortunately, 2D evolutionary calculations of 
rotating massive stars up to the stage of core collapse, which could help to 
narrow down the range of initial (i.e., pre-collapse) angular momenta of iron 
cores, have not yet been performed (for first attempts in this direction, see 
e.g., Danger et ai. 1997). 

In spite of the unknown rotational state of the iron core prior to col- 
lapse, several attempts have been made in the past two decades to study 
the influence of rotation on the dynamics of Type II supernova explosion 
by means of hydrodynamic simulations (LeBlanc & Wilson 1970; Muller et 
aJ. 1980; Tohline et ai. 1980; Muller & Hillebrandt 1981; Symbalisty 1984; 
Monchmeyer & Miiller 1989; Finn & Evans 1990; Monchmeyer et ai. 1991; 
Marck & Bonazzola 1992; Monchmeyer 1993; Yamada & Sato 1994; Zwerger 
1995; Zwerger & Muller 1997). All of these simulations, except those recently 
performed by Zwerger & Muller (Zwerger 1995; Zwerger & Muller 1997), were 
restricted to a few specific core collapse models. They incorporated a more or 
less sophisticated treatment of the complicated microphysics and transport 
processes involved in the event (see Sects. 3.2 and 3.6). 

Zwerger & Muller followed up a somewhat complementary approach. They 
restricted themselves to a simplified description of the microphysics approx- 
imating the iron cores by rotating polytropes and neglecting any transport 
effects. Thereby they were able to perform a truly comprehensive parame- 
ter study of the space spanned by possible pre-collapse iron core models. A 
similar approach has been pursued by Yamada & Sato (1994). 

Concerning rotational core collapse calculations it is important to note 
two important facts. Firstly, all simulations up to now, which take into ac- 
count the effects of rotation, have only followed the evolution up to the point 
when the prompt shock wave turns into an accretion shock. The influence 
of rotation on the delayed explosion mechanism, and in particular the inter- 
action of rotation and convective instabilities, still has to be explored (for a 
first crude attempt see Shimizu et ai. 1994). Secondly, all these simulations 
have been performed using Newtonian hydrodynamics, i.e., relativistic effects 
have been neglected. 

3.7.1 General Considerations. The most important difference between 
the collapse of rotating and non-rotating cores is that in rotating cores cen- 
trifugal forces may stop the collapse before nuclear matter density is reached. 
This possibility was pointed out by Shapiro & Lightman (1976) and dis- 
cussed in more detail by Tohline (1984) and Eriguchi & Muller (1984). Nu- 
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merical examples of such a low-density bounce have been given by several 
authors (Muller et aJ. 1980; Symbalisty 1984; Monchmeyer & Muller 1989; 
Monchmeyer 1993; Yamada & Sato 1994; Zwerger 1995; Zwerger & Muller 
1997). 

A theoretical argument for the stabilizing influence of rotation on 
(pseudo) radial modes of stars was derived by Ledoux (1945). To guaran- 
tee the stability of a configuration in rotational equilibrium the adiabatic 
index (at fixed 5 and see Eq. (3.5) should fulfill the condition 



_ 2 (2 - 5/?) 

7 > Tcrit - 3 (1 _ 2/3) ’ 



the rotation parameter /? being defined by 






Erot 

l-^potl 



(3.22) 



(3.23) 



Here Erot and Epot are the rotational and potential energy, respectively. 
Although Eq. (3.22) was derived for slow rigid rotators with 7 indepen- 
dent of density, numerical calculations have shown that Eq. (3.22 holds un- 
der more general conditions, too (e.g.,Tassoul 1978). Moreover, Eq. (3.22) 
is only applicable if corrections due to general relativity are small (i.e.,if 
GM/Rc^ <C 1). Otherwise, the general relativistic analogue of Eq. (3.22) must 
be used (e.g.,Misner et al. 1973, Chap. 26). Compared to the Newtonian one 
the relativistic criterion gives a larger 7critj because general relativity tends 
to destabilize configurations. 

For a given average 7 the critical value of P a. rotating core in equilibrium 
must exceed to be stable against pseudo-radial modes is 



/? > yScrit = I ^ 
2 (5 - 37) 



(3.24) 



This is also a necessary condition for a core bounce to occur at sub-nuclear 
densities (Tohline 1984; Monchmeyer 1993). Actually, the stability and hy- 
drodynamic evolution of a collapsing iron core is determined by the effective 
adiabatic index F (see Eq. (3.7)). Hence, 7 must be substituted by F in 
Eq. (3.22) and (3.24). 

Due to conservation of angular momentum the centrifugal forces increase 
significantly during collapse. Matter in the equatorial plane does not fall to- 
wards the center as fast as matter at the polar axis leading to a progressive 
flattening of the core. In addition, the collapse time scale is lengthened com- 
pared to non-rotating models. 

Conservation of angular momentum may also lead to very rapidly ro- 
tating configurations, which are unstable against tri-axial deformations on 
secular or even dynamical time scales (Eriguchi & Muller 1984; Tohline 1984; 
Monchmeyer et al. 1991; Zwerger 1995; Zwerger Sz Muller 1997). For MacLau- 
rin spheroids, i.e.,for incompressible, rigidly rotating equilibrium configura- 
tions, secular and dynamical tri-axial instabilities grow, if P > 13.75% and 
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P > 27.38%, respectively (e.g., Tassoul 1978). Whether these instabilities do 
indeed occur (and if so, at which values of P) in compressible, differentially 
rotating stars is still an open question. If the equation of state at sub-nuclear 
densities is stiff, i.e.,if the adiabatic index is very close to 4/3, the core may 
be stabilized before its rotational energy exceeds the critical value. If, on the 
other side, the initial amount of rotation is small enough for the collapse to 
proceed to nuclear densities, /? > 0.14 may not be reached before bounce. 

After bounce a rotating core oscillates with a superposition of various 
radial and surface modes. The frequency of these modes is determined by the 
average density of the inner core. In contrast to a spherically symmetric core, 
which comes to rest soon after bounce, in a rotating core a certain fraction of 
its kinetic infall energy will be converted into oscillations, which are damped 
by nonspherical pressure waves. These oscillations strongly contribute to the 
gravitational wave signal of the bouncing core (see Sect. 3.8). Due to the 
asymmetric infall of matter the propagation of the shock wave at the pole 
differs from that near the equatorial plane. 

Finally, it is very well known that rotation can drive meridional circula- 
tions in baroclinic regions (Tassoul 1978). For core collapse supernova mod- 
els the interesting question arises, whether there exist large-scale circulation 
patterns behind the shock, that extend inside the neutrino-sphere and which 
could transport neutrinos towards the shock front much faster than diffusion 
(Muller & Hillebrandt 1981). 

3.7.2 Simulations of Rotational Core Collapse I: Models with De- 
tailed Microphysics. Up to now the most realistic iron core collapse mod- 
els have been calculated by Monchmeyer and co-workers (Monchmeyer & 
Muller 1989; Monchmeyer et al. 1991; Monchmeyer 1993). They considered 
the axi-symmetric collapse of a 1.36Mq iron core of a 20Mq star. The spher- 
ically symmetric initial model was put into a state of rotation by adding 
a certain (small) amount of rotational energy to it, i.e.,the initial model 
was not in rotational equilibrium. Four different angular momentum dis- 
tributions were examined. The initial rotation parameter was in the range 
0.005 < P < 0.02, and during collapse angular momentum conservation was 
assumed. A realistic equation of state (Hillebrandt & Wolff 1985) was used 
in the simulations, and electron captures on protons were taken into account. 
The neutrino transport was simplified by a leakage scheme. The main results 
obtained by Monchmeyer et aJ. (1991) are: 

— An initially fast rotating model {p\ ~ 0.02) bounces well below nuclear 
matter density (^b — 3.6 x lO^^gcm"^) and evolves into a quasi-static 
rotating equilibrium configuration with a final rotation parameter p{ = 
0.14. 

- If the initial rotation rate is moderate (/?i = 0.01) the centrifugally caused 

bounce occurs just below nuclear matter density (^b 10^"^gcm“^) and 

causes large-scale amplitude oscillations of the core, which strengthen the 
prompt shock wave. 
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- If the initial rotation rate is small (/3i ^ 0,005) the bounce occurs due to 
the stiffness of nuclear matter. In this case rotation “hurts” the prompt 
explosion mechanism. The collapse, which is fastest near the rotation axis, 
leads to the formation of an aspherical shock wave, which is strongest near 
the rotation axis. Hence, an entropy bubble forms at the polar axis, which 
is convectively unstable. The resulting convective flow weakens the prompt 
shock by transporting high entropy material from the polar region towards 
the equator. 

- Measuring the gravitational wave signal allows one to discriminate between 
a bounce caused by the stiffness of nuclear matter and a bounce caused by 
centrifugal forces (see Sect. 3.8). 

3.7.3 Simulations of Rotational Core Collapse II: Poly tropic Mod- 
els. The following discussion is restricted to the results of Zwerger & Muller 
(Zwerger 1995; Zwerger & Muller 1997), because their numerical study is 
the most comprehensive one (but see also Finn Sz Evans 1990; Yamada & 
Sato 1994). In their simulations the collapsing core was approximated by a 
rotating polytrope in order to examine a large set of initial conditions. They 
used a simplified analytic equation of state (Janka et ai. 1993) neglecting all 
transport effects. The equation of state consisted of a “polytropic” part Pp 
and a “thermal” part Pth, 



P — -^p "b -^th ‘ 



(3.25) 



The poly tropic part Pp accounts for pressure contributions due to degenerate, 
relativistic electrons and (at high densities) due to the repulsive action of 
nuclear forces. Pth mimics the thermal pressure in the matter heated up by 
the prompt shock. Writing P as a sum of these two terms prevents matter 
elements from following the same P- ^history during infall and expansion 
after bounce. Two different density regimes are considered, where 



f if ^ < ^nuc 

I K 2 * , otherwise. 



(3.26) 



The “nuclear density” ^nuc = 2 x 10^^ gem the supra-nuclear adiabatic 
index P 2 = 2.5, and the value of K\ is fixed to 



Ki = ^ (5)2/3 he (^)‘*/3 = 4.897 X lO^^cgs, 
4 3 me 



(3.27) 



which holds for an ideal gas of ultra-relativistic, degenerate electrons (Pi = 
4/3). From the first law of thermodynamics the corresponding thermodynam- 
ically consistent energy density follows (for adiabatic changes) as 






+ E'i' Q, 



if Q< Qnu 
otherwise , 



(3.28) 



where Ei and £2 are related to K\ and K 2 by 
Ki = (A - 1) ■ £1 and K 2 = (P 2 - 1) • E 2 . 



(3.29) 
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The constants E 2 (and hence K 2 ) and are fixed by requiring continuity 
of Pp and Up at ^nuc* This leads to 

= and E3 = • Ei • (3.30) 

The thermal pressure Pth is related to the thermal energy density Uth by 
an ideal gas relation 

Pth = (Pth-l)-wth, (3.31) 

with Pth = 1*5 corresponding to a mixture of relativistic and non-relativistic 
gases. The total energy density 

u — up H“ Uth* (3.32) 

Initially Uth and Pth are zero and should remain so in the absence of shocks. 
Further details are given in Janka et aL (1993). 

In their study Zwerger & Muller 1997 computed the evolution of models 
with Pi = 1.325, 1.3225, 1.32, 1.315, 1.31, 1.30, 1.29 and 1.28, respectively. 
The pre-collapse models had rotation parameters /3\ = 0.0025, 0.0045, 0.009, 
0.018 and 0.04, respectively. Initially all models rotated according to the 
rotation law (Eriguchi & Muller 1984) 



f2{w) = 



j?o 

1 -h (vj/A)^ 



(3.33) 



where w is the distance from the rotation axis. The constant J?o is uniquely 
related to the parameter , while A is a free parameter determining the initial 
angular momentum distribution. For large values of A (compared to the size 
of the initial model) one obtains almost rigidly rotating configurations, while 
small values of A correspond to strongly differentially rotating ones. Zwerger 
& Muller examined models with yl = 5 x 10^ cm, 10® cm, 5 x 10^ cm and 
10^ cm, respectively. 

Contrary to all previous studies, except those of Bonazzola & Marck 
(1993, 1994), the initial models were in rotational equilibrium being calcu- 
lated with the method of Eriguchi & Muller (1984). The pre-collapse models 
were P = 4/3 poly tropes, whose collapse was induced by suddenly reducing 
P to one of the above specified values Pi. In the parameter space spanned 
by Pi, /?i and A, Zwerger & Muller computed a total of 81 initial models and 
followed their evolution well beyond core bounce. 

A subset of models suffers a bounce caused by centrifugal forces at sub- 
nuclear densities. For a given the bounce density decreases with increasing 
Pi and decreasing A, i.e., strongly differentially rotating models with Pi close 
to 4/3 have the lowest bounce densities ^b- Models with A = 10^ cm and 
(3\ > 0.009 bounce at 6.2 x 10^^ gem”® < Qb ^ 7.0 x 10^®gcm“® the model 
with the lowest having = 0.04 and Pi = 1.30. But even for quite rigidly 
rotating configurations with A = 10® cm the collapse is stopped in some cases 




3. Simulations of Core Collapse Supernovae 391 




40 60 80 100 120 140 



time [msec] 

Fig. 3.17. Centred density as a function of time for two models, which experience 
a bounce due to centrifugal forces at sub-nuclear densities (from Zwerger & Muller 
1997) 



well below nuclear matter density, e.g.^g^y = 3.6 x lO^^gcm”^ for = 0.018 
and Fi = 1.32, and = 7.0 x lO^^gcm"^ for /3j = 0,009 and Ji = 1.325. 

Models suffering a bounce due to (or mainly due to) centrifugal forces 
show large amplitude oscillations of the inner core. The central density varies 
by more than a factor of ten in such models (Fig. 3.17), For example, in 
the model with A ^ 5 x 10^ cm, = 0.018 and Fi = 1.31 (solid curve 
in Fig. 3.17) the bounce density is = 4.6 x lO^^gcm”^. Then the model 
re-expands, the central density dropping down to ^ = 7.5 x lO^^gcm'^. 
Eventually the model achieves a new equilibrium state with a central density 
of ^ = 1.3 X 10^^ gcm”^. Even more extreme is the model with A = 10® cm, 
P\ = 0.009 and A = 1.325 (dotted curve in Fig. 3.17), where the central 
density drops by about a factor of 50 from 6.9 x 10^® gem"® (at bounce) to 
1.4 X 10^^ g cm"® (at maximum re-expansion) and then rises again by a factor 
of about 30 to 4.4 x 10^® gem"® (during the second bounce Fig. 3.17). 

Because of angular momentum conservation the effects of rotation be- 
come more important during collapse, which is reflected by an increase of the 
value of (3. For several models 0 exceeds the critical value (3 = 0.1375, where 
MacLaurin spheroids become secularly unstable against tri-axial perturba- 
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Fig. 3.18. Time evolution of the rotation pareuneter (3 for four models: A = 10^ cm, 
= 0.04 and A = 1.28 (solid), A — 10^ cm, = 0.04 and A = 1.30 (dash- 
dotted), A = 5 X 10^ cm, = 0.04 and A = 128 (dotted), A = 5 x 10^ cm, 
= 0.04 and A = 1-30 (dashed). The dotted horizontal lines mairk the critical 
values /?sec (lower line) and /?dyn (upper line), where MacLaurin spheroids become 
secularly and dynamically unstable against non-axisymmetric perturbations. (From 
Zwerger &: Muller 1997) 



tions. Most of the secularly unstable models with A < 10® cm, (3\ > 0.009 
and A < 1-32 remain in the unstable regime for several milliseconds, and 
for (3i = 0.04 even for several tens of milliseconds (Fig. 3.18). Whether this 
length of time is long enough to allow for a significant growth of the secular 
instability is, however, unclear. Two of the most differentially (A = 10^ cm) 
and rapidly {(3\ = 0.04) rotating models reach (A = 1.30) and even exceed 
(A — 1-28) the critical value for dynamical stability {(3 ^ 0.2738; Fig. 3.18). 
Although such large values of p only prevail for about one millisecond, this 
length of time is sufficiently long for a significant growth of the dynamical 
instability (see Sect. 3.8.6). 

3.8 Gravitational Wave Signature of Core Collapse Supernovae 

Until the year 2001 four long-baseline laser interferometric gravitational wave 
detectors will become operational (GEO600, LIGO, TAMA, VIRGO). These 
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new instruments as well as several presently being built cryogenic resonant- 
mass detectors (including improved conventional bars cooled to temperatures 
of ^ 50 mK and more recent large mass spherical detectors, like e.g.,TIGA 
(Merkowitz & Johnson 1995; see also Harry et ai. 1996)) will achieve a (broad- 
band and/or narrow-band) sensitivity allowing astronomers for the first time 
in history to directly detect gravitational waves and to enter the area of gravi- 
tational wave astrophysics (for a review see, e.g., Abramovici et ai. 1995). The- 
oretical predictions of the gravitational wave signature (wave forms, spectra, 
duration) of potential astrophysical sources are thus urgently needed, e.g,, to 
improve the low signal-to-noise ratio. Among the potential sources core col- 
lapse supernovae (i.e.,SNeII and SNelb/Ic) involving the formation of a 
neutron star or a black hole are thought to emit a significant amount of grav- 
itational radiation, if the collapse and/or the early phase of the explosion 
is very nonspherical (for a review see Thorne 1987; Piran 1990; Bonazzola 
& Marck 1994; Thorne 1995; Muller 1997). Non-sphericities can be caused 
by the effects of both rotation and convection leading to an overall coher- 
ent deviation from spherical symmetry or small-scale statistical fluctuations, 
respectively. 

Measuring the gravitational wave signal of core collapse supernovae pro- 
vides unique and direct information about the collapse dynamics. Note in 
this respect, that the electromagnetic “display” of a core collapse supernova 
only starts when the shock wave reaches the surface of the progenitor star 
hours after core bounce. Besides gravitational waves, the only other means to 
directly probe the collapse dynamics are neutrino observations, A measure- 
ment of both the neutrino and the gravitational wave signal would be most 
exciting, because it would provide extremely important and complementary 
information for the theoretical modeling of core collapse supernovae. 



3.8.1 Quadrupole Formula. The gravitational wave signal is most com- 
monly calculated in the post-Newtonian quadrupole approximation (u c 
and ^ c^), in which the gravitational wave amplitude is proportional to the 

second time derivative of the mass-energy quadrupole moment. The numer- 
ically troublesome second-order time derivative of the quadrupole moment 
can be transformed into much better tractable spatial derivatives. In par- 
ticular, the gravitational quadrupole radiation field, can be calculated 
with an expression derived independently by Nakamura & Oohara (1989) and 
by Blanchet et aJ. (1990): 



2G 



X J d^x Q 



Pijki{N) X 

2v^v^ - di^ - x^ dk^ 



(3,34) 



where R = l-X”! is the distance between the observer and the source, ^ is 
the Newtonian gravitational potential, q is the mass-density and v is the 
velocity. The other quantities have their usual meanings except for Pijki {N) 
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(with N = X/R) which denotes the transverse-traceless (TT) projection 
operator onto the plane orthogonal to the outgoing wave direction iV, acting 
on symmetric Cartesian tensors according to 

PijkiW = - ^i^k) ’ N^Ni) + 

- \{Sij - NiNj) (Ski - NkNi) . (3.35) 



di represents the partial derivative with respect to the x* coordinate. The 
integrand in Eq. (3.34) is defined on a compact manifold and is known to 
the (2nd order) accuracy level of the numerical algorithm of the hydro-code. 
Eq. (3.34) can be shown to be equivalent to the standard representation 

where the trace-free part of the mass-quadrupole tensor of the matter distri- 
bution is given by 

Qij(t) = J Q(x,t) 



r 2 

00 i,00 j ~ 0 ‘ij OC 



It can be easily shown that evaluating the integral of Eq. (3.34) by an inte- 
gration scheme (of at least 2nd order) is by one order of accuracy superior 
to twice applying numerical time-differentiation methods to quadrupole data 
given at discrete points of time (Finn & Evans 1990; Monchmeyer et ai. 1991). 

3.8.2 Evaluation of the Quadrupole Formula I: Axisymmetric 
Source. The gravitational radiation field gives direct information about the 
second time derivative of the mass-quadrupole tensor Eq. (3.37). In case of 
axisymmetry the quadrupole moment, Q, is the only independent component 
of the quadrupole tensor. Its relation to the Cartesian component Qzz of the 
radiative mass-quadrupole tensor is 




For 2D (axisymmetric) simulations, which are most conveniently per- 
formed in spherical coordinates, it is natural to represent the (total) radiation 
field in terms of the “pure-spin tensor harmonics” and 

with amplitudes and in the following way (Thorne 1980): 



1 ^ f p 

h^^(x,t) = I E E { - f ) <p) + 

2 m=~l ^ 

+ (3.39) 

In spherical coordinates the coefficients A^ and A^ have especially simple 
integral representations over the source. By symmetry, there is only one non- 
vanishing quadrupole term in Eq. (3.39), namely A^o^ Higher-order terms are 
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neglected in the quadrupole approximation of the gravitational radiation 
field Transforming Eq. (3.34) to spherical coordinates and expressing vi 
in terms of unit vectors in the r, 6 and 0 direction, one obtains by comparison 
of Eq. (3,34) with the lowest-order term of Eq. (3,39) for the quadrupole wave 
amplitude the expression 
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^20 
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(3.40) 



where dr — djdr, 3 q — djdQ, and z = cos 0 . 

Prom the definition of (Thorne 1980, Eq. (2.39e)) one derives for 

the components of the formula 



h 



TT _ 
Qd - 





(3.41) 



The only other nonzero component is hJJ — —hj^. Note that due to the 
hypotheses of (i) axisymmetry and (ii) of post-Newtonian gravity hj^ = h_|_ 
and hx = 0 , where and hx are the wave amplitudes corresponding to the 
two independent polarizations of the gravitational radiation field (see next 
subsection) . 

The total energy radiated in gravitational waves is given by the general 
expression 



G 327T 




(3.42) 



The total energy radiated per unit frequency dE/du can be calculated from 
the amplitudes A 20 using the fast Fourier transform technique. 

3.8.3 Evaluation of the Quadrupole Formula II: Genuine Three- 
Dimensional Source. If the source is of genuine three-dimensional nature, 
it is common to express the gravitational quadrupole radiation field, in 
the following tensorial form (Misner et ai. 1973, Chap. 35): 

h^(X,t) = i (A+e+ + Axex) (3.43) 

with the unit linear-polarization tensors 

e_|_ = €0 e& - (Sf , (3.44) 

6 x = €0 0 60 + 60 ( 8 ) 60 , (3.45) 

60 and 60 being the unit polarization vectors in 0 and (/^-direction of a spheri- 
cal coordinate system and 0 the tensor product. The amplitudes A^ and Ax 
(or the dimensionless amplitudes = A^/R and hx = Ax/R) represent 
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the only two independent modes of polarization in the TT gauge, and are 
given by the following expressions for ^ = 0, — 0: 



— ixx iyy ? (3.46) 

= 24y , (3.47) 

and for 0 = 7 t/ 2, = 0 by 

A+ = h, - iyy , (3.48) 

- - 24 , , (3.49) 

where 

hj ~ ^ J Q ( 2 t'iUj - Xidj^ - Xjdi^) . (3.50) 



The total energy radiated in form of gravitational waves is given by 
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AyyAzz + 3 {A^y -h A^^ + (3-^2) 



with 

Aij = iij . (3.53) 

3.8.4 Gravitational Radiation from Convective Instabilities. The 
gravitational wave signature of convective instabilities inside the proto- 
neutron star and in the neutrino heated post-shock region has been investi- 
gated in detail by Muller & Janka (1997). They calculated the quadrupole 
waveforms, the energy spectra, and the total amount of the emitted gravita- 
tional wave energy due to the anisotropic mass distribution and mass motion. 
Moreover, they estimated the amplitude and energy of gravitational waves 
associated with the anisotropic neutrino emission that is caused by the con- 
vective transport of neutrinos and by aspherical perturbations of tempera- 
ture and density in the neutrinospheric region. The investigation included 
the analysis of both a 2D and a 3D simulation of convection inside the proto- 
neutron star (Muller & Janka 1997), as well as three 2D simulations of the 
convective instability in the hot-bubble region covering the evolution for up 
to 1 sec past core bounce (Janka & Miiller 1996b). 

The only other related investigation up to now has been performed by 
Burrows & Hayes (1996). These authors conducted a single exploratory and 
experimental calculation of aspherical collapse, imposing a large amplitude 
(15%) density perturbation within 20° of the pole (i.e.,the symmetry axis) 
of their pre- collapse iron core model. Simulating the core’s evolution in 2D 
until about 50 ms past core bounce, they determined the gravitational wave 
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signature due to anisotropic neutrino radiation and mass motion. The 2D 
results of Muller & Janka (1997) are in rough qualitative agreement with 
the findings of Burrows & Hayes (1996), although there are quantitative 
differences. 

Muller & Janka (1997) found that for a supernova located at a distance 
of lOkpc the maximum dimensionless gravitational wave amplitudes 
associated with convective mass motions range from about 9 x 10“^^ to 
4 X 10“^^ . The total amount of the emitted energy varies from 3 x 1O”^^M0C^ 
to 5 X 10" Convective motions inside the proto-neutron star involve 

more mass and are more violent and therefore produce a stronger gravita- 
tional wave signal with up to a factor of 10 larger wave amplitudes. Most of 
the gravitational radiation from convection inside the proto-neutron star is 
emitted in the firequency band 100-1000 Hz, while convective motions in the 
hot-bubble region generate waves from several 100 Hz down to a few Hz. 

Comparing different 2D models Muller & Janka (1997) find that with 
increasing total neutrino luminosity and hence with increasing explosion en- 
ergy the convective activity in the hot-bubble region changes from violent 
convective overturn associated with anisotropic accretion processes to rapid 
overall expansion and relatively slowly changing large-scale deformations of 
the expanding shells behind the outward propagating supernova shock. This 
change of the characteristics of non-radial motions in the hot-bubble region 
is directly refiected in the dominant frequencies of the gravitational wave sig- 
nal. While turbulent overturn around the proto-neutron star produces grav- 
itational waves with most power at frequencies of 100-200 Hz, the dominant 
frequencies are at only some 10 Hz when the period of convective activity 
is short and the non-sphericity of the model is determined by the explosive 
expansion. Thus, a measurement of the frequency of the wave signal would 
provide important insights into the explosion dynamics. Moreover, since the 
signal produced by the convection inside the proto-neutron star is typically 
of much higher frequency (about 1000 Hz), it would also be possible to dis- 
criminate the contributions from the two convection zones to the measured 
signal. 

Interestingly, structures in the gravitational wave signal are well corre- 
lated with prominent features in the neutrino emission, if both gravitational 
wave and neutrino production are associated with dynamical processes in 
and around the nascent neutron star. Simultaneous information from both 
neutrino and gravitational wave measurements would therefore impose im- 
portant constraints on core collapse models. 

Anisotropic neutrino emission generates gravitational waves, too (Epstein 
1978; Burrows & Hayes 1996). Muller & Janka (1997) estimated the degree 
of anisotropy from the density and temperature inhomogeneities associated 
with convective and turbulent processes in the neutrino-spheric region. They 
find that for typical post-bounce neutrino luminosities the gravitational wave 
amplitude can be larger than the wave amplitude due to mass motions by 
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a factor of about 5, although the energy in the neutrino tidal field is only a 
minor contribution (few percent) to the total energy radiated in gravitational 
waves. 

As emphasized by Muller & Janka (1997) the results for the gravitational 
wave signal associated with nonspherical neutrino emission depend sensitively 
on the duration of the phase of anisotropic neutrino loss and on the temporal 
evolution of the total luminosity in all kinds of neutrinos. These character- 
istics will vary with the properties of the exploding star and thus with the 
parameters of the forming neutron star and will also be sensitive to the details 
of the numerical scheme and physical input used for the simulations. 

Muller & Janka (1997) further found that the 3D simulation of convection 
inside the proto-neutron star gives a strongly reduced gravitational wave sig- 
nal compared to the corresponding 2D model. The main reason for this is that 
in three spatial dimensions the convective structures and elements are smaller 
(Z3D 0.5/2 d)j move less fast (usd ^ 5 x 10® cm/s 0.5 v 2 d)» and, corre- 
spondingly, show less strong overshooting and undershooting (0.8 instead 
of 1.2 pressure-scale heights). The maximum quadrupole amplitudes due to 
mass motions are reduced by about a factor of 30, the gravitational wave 
amplitudes associated with the anisotropic neutrino emission by as much as 
a factor of 10. The total amount of energy radiated in form of gravitational 
waves is 2-3 orders of magnitude smaller in 3D. A similarly strong reduction 
of the signal strength is to be expected for the gravitational waves emitted by 
turbulent motions in the hot-bubble region when corresponding simulations 
will be performed in three spatial dimensions. 

All neutrino driven supernova models that have been analyzed for their 
gravitational wave emission up to now do not include any rotational effects. 
Therefore, future supernova modeling will have to determine the strength 
and the form of the gravitational wave signal caused by the combined effects 
of rotation, convection inside the proto-neutron star, and neutrino-driven 
overturn in the hot-bubble region. 

3.8.5 Gravitational Radiation from Axisymmetric Rotational Core 
Collapse. The gravitational wave signature of axisymmetric rotational core 
collapse has been computed by various authors (Muller 1982; Finn & Evans 
1990; Monchmeyer et al. 1991; Yamada & Sato 1995; Zwerger 1995; Zwerger 
& Muller 1997). In the following I mainly discuss the results of Zwerger <fe 
Muller (Zwerger 1995; Zwerger & Muller 1997), because they are the most 
comprehensive ones and because they more or less comprise the other results, 
too. However, two points from the other investigations are worth mentioning 
here. 

Firstly, Monchmeyer et al. (1991) found that there exist two structurally 
different types of gravitational wave signals emitted during rotational core 
collapse. “Standard” signals of type I exhibit a large wave amplitude at 
bounce, which is followed by a “ring-down” of the signal, i.e.,by oscillations 
with decreasing damped amplitudes. This “ring-down” is due to damped 
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ff Pb 

[g/cm"] [M.C*] 



1.325 1.73e+14 8.78e-09 
1.320 2.79e+14 5.31 e-08 
1.310 3.36e+14 2.79e-08 
1.300 3.72e+14 4.31 e-09 
1.280 4.23e+14 4.28e*10 



Fig. 3.19. Quadrupole amplitudes produced by the collapse of almost rigidly 
rotating poly tropes with p\ = 0.009. In each panel the vertical dotted line marks 
the time of bounce. The adiabatic index A , the central density at bounce and the 
total amount of energy radiated in form of gravitational waves are given for each 
model (by the line from top to bottom) in the table in the lower right paxt of the 
figure. (From Zwerger & Muller 1997) 



volume and surface oscillations of the inner core after bounce. Signals of 
type I have been found in all other investigations, too (see top right panel 
of Fig. 3.19). They axe produced by cores bouncing at nuclear density or at 
low central densities when the ratio of (radial) kinetic energy to rotational 
energy is small. Signals of type II are characterized by several pronounced 
peahs, which arise if the core bounces more than once (see top left panel 
of Fig. 3.19). Between the peaks the signal varies smoothly. Monchmeyer et 
aJ. (1991) could not clarify which conditions are sufficient for the appearance 
of a single dominant coherent volume mode causing the type II signal. Hence, 
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r. p, Ec 

[g/cm’] [M«c^] 



1.325 1,73e+14 8.78e-09 
1.320 2.79e+14 5.31 e-08 
1.310 3.36e+14 2.79e-08 
1.300 3.72e+14 4.31 e-09 
1.280 4.23e+14 4.28e-10 



Fig. 3.20. Same as Fig. 3.19, but showing the spectral energy density dE/du. (From 
Zwerger & Muller 1997) 



they wondered whether such signals are only produced by models in a very 
narrow parameter range (see below). 

Secondly, performing a set of nine collapse calculations of rotating poly- 
tropes, Yamada & Sato (1995) found that the peak amplitude of burst-like 
type I gravitational wave signals saturates for q ^ 0.5, where q = is 

the dimensionless angular momentum of the core. They further found that 
the peak amplitude is sensitive to the stiffness of the equation of state just 
below nuclear matter density. 

Zwerger & Muller (Zwerger 1995; Zwerger & Muller 1997) found both 
type I and type II gravitational wave signals. Which type occurs is solely 
determined by the adiabatic index, and in particular does not dependent on 
whether the central density of the core reaches nuclear matter density or not. 
Signals of type I are produced by models with a “soft” equation of state (Ti ^ 
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frequency [Hz] 



Fig. 3.21. The dimen- 
sionless 

gravitational wave am- 
plitude of the models of 
Zwerger (1995). The er- 
ror bar gives the fre- 
quency range, which con- 
tains most of the spec- 
tral power. It is assumed 
that the source is lo- 
cated at a distance of 
lOMpc. The amplitudes 
of the four models cal- 
culated by Monchmeyer 
et aJ. (1991) are also 
shown and are marked by 
crosses. (From Zwerger & 
Muller 1997) 



1.31), while signals of type II require a ‘‘stiff” equation of state {Fi ^ 1.32). 
The distinct amplitude spikes of the type II signals nicely correlate with the 
maxima of the central density. Moreover, decreasing the adiabatic index from 
1.325 to 1.28 and keeping both other model parameters (i.e., /3j and the initial 
rotation law) fixed, Zwerger & Muller observe a smooth transformation of the 
signal type. This is illustrated for a sequence of models with /3\ ^ 0.009 and 
A = 5 X 10® cm in Fig. 3.19. For Fi = 1.325 the signal is of type II. It is of an 
intermediate type (i.e., between typell and typel) for F\ — 1.32, and turns 
into a “standard” typel signal for A = 1.31 and 1.30. Finally, for A 1.28 
a third type of signal is observed. Instead of showing a pronounced spike at 
bounce with a large negative wave amplitude, this so-called type III signal 
has a large positive and a somewhat smaller negative wave amplitude just 
before and after bounce, respectively. Signals of type III are not found for 
extremely differentially rotating initial models {A = 10^ cm). 

Zwerger & Muller also computed the amplitude and energy spectra of the 
gravitational radiation emitted by their models. The energy spectra cover a 
frequency range of 50Hz ^ z/ ^ 3 kHz, but most of the power is emitted 
between 500 Hz and IkHz. Models bouncing at sub-nuclear densities have 
spectra, which drop extremely rapidly above 1 kHz, and models with a typell 
wave signal have spectra, which show characteristic oscillations (Fig. 3.20). 
These oscillations vanish when the signal type changes to typel. Moreover, 
a detailed analysis reveals that the oscillatory spectral component appears 
immediately after the second bounce. Additional bounces produce no new 
spectral features. They only increase the overall spectral power. The results 
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further show that the spectra are neither very sensitive to nor to the degree 
of differential rotation. 

The total amount of energy radiated in form of gravitational waves lies 
in the range 5 x 10“^^ M©c^ ^ fi'cw ^ 8 x 10~® M©c^. The corresponding 
dimensionless wave amplitudes are in the range 4 x ^ ^ ^ 4 x 10“^^ 

for a source at a distance of lOMpc (Fig. 3.21). The largest signals are either 
produced by models which are initially slowly (/?•, < 0.009) rotating and have 
an adiabatic index Fy > 1.32, or which are initially rapidly [ji\ > 0.018) 
and strongly differentially [A — 10^ cm) rotating and have a relatively small 
adiabatic index {Fy < 1.30). The first class of models experiences a bounce 
at densities above nuclear matter density with a fast deceleration of the col- 
lapsing core. This gives rise to a rapid change of the quadrupole moment of 
the core and consequently to a strong gravitational wave signal. In the other 
models the quadrupole moment is larger due to fast rotation, but changes 
less rapidly. 

3.8.6 Gravitational Radiation from Non-axisymmetric Rotational 
Core Collapse. Concerning the collapse of initially rapidly rotating cores 
many authors have pointed out the possibility that if the core becomes sec- 
ularly {/3 > Psec ^ 0.14) or dynamically {P > Pdyn ~ 0.27) unstable to 
non-axisymmetric perturbations, a much stronger gravitational wave signal 
might be produced than by cores which remain axisymmetric during collapse 
(for a review, see e.g.,Piran 1990; Thorne 1995). According to this idea, the 
core will experience a centrifugal hang-up before nuclear matter density is 
reached and will be transformed into a bar-like configuration that spins end- 
over-end like an American football, if the rotation rate of the core exceeds 
the critical rotation rate(s). One has further speculated, whether the core 
might even break up into two or more massive pieces, if /3 > Pdyn- In that 
case the resulting gravitational radiation could be almost as strong as that 
from coalescing neutron star binaries (Thorne 1995). The strength of the 
gravitational signal sensitively depends on (i) the radius at which the cen- 
trifugal hang-up occurs and (ii) what fraction of the angular momentum of 
the non-axisymmetric core goes into gravitational waves, and what fraction 
into hydrodynamic waves. These sound and shock waves are produced as 
the bar or lumps, acting like a twirling-stick, plow through the surrounding 
matter. 

Nearly all investigations up to now did not consider the dynamics of 
non-axisymmetric collapse, but only studied the growth of secular and dy- 
namical instabilities in already collapsed cores (i.e.,in nascent neutron stars) 
modeled as rapidly rotating, non-axisymmetric equilibrium poly tropes or el- 
lipsoids (Yoshida Sz Eriguchi 1995; Imamura et aL 1995; Lai & Shapiro 1995; 
Houser et aJ. 1994; Smith et a/. 1996; see also Pickett et al. 1996). Note that, 
as the literature on this subject is extensive, I have only referenced the most 
recent studies here. Such an approach can be used to predict the gravita- 
tional radiation from rapidly rotating neutron stars, which might form as a 
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Fig. 3.22. Ray-cast images of the development of a non-axisymmetric instability 
in a collapsing, initially axisymmetric, rapidly rotating core. About 2 ms before 
the core becomes dynamically unstable against non-axisymmetric deformations the 
model has been perturbed by imposing a 5% density variation of the form sin(m0) 
with m = 3. The intensity is a measure of the density (brighter regions have larger 
densities) and the colors encode the thermal energy increasing from red, over yellow 
and green to blue. The size of the blue frame is 160 x 160 x 80 km 
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consequence of core collapse. However, it is not appropriate for predicting the 
gravitational wave signature of the collapse of rapidly rotating cores. I stress 
this difference here, because it is often overlooked: Numerical simulations of 
non-axisymrnetric instabilities in equilibrium post-collapse models are inad- 
equate to obtain information about the evolution of the density and angular 
momentum distribution of collapsing axisymmetric cores, which eventually 
become non-axisymmetric. 

Up to now three-dimensional hydrodynamic simulations of rotational core 
collapse, which can follow the nonlinear growth of non-axisymmetric instabil- 
ities during collapse, have only been performed by Bonazzola & March (1993, 
1994), and by March & Bonazzola (1992). For their simulations they used a 
pseudo-spectral hydrodynamic code and a polytropic equation of state. They 
computed the evolution of several initial models and found that the gravita- 
tional wave amplitude is within a factor of two of that of 2D simulations for 
the same initial deformation of the core (Bonazzola & March 1994). However, 
their simulations were restricted to the pre-bounce phase of the collapse, and 
thus are less relevant. 

Recently, Rampp et ai. (1997) have also explored non-axisymmetric in- 
stabilities of collapsing, rapidly rotating cores and computed the resulting 
gravitational wave signal. Their starting point were the axisymmetric ro- 
tational core collapse models of Zwerger (Zwerger 1995; Zwerger & Muller 
1997). They selected those models, which during collapse eventually reach 
rotation rates making them unstable against non-axisymmetric instabilities 
on secular or dynamical time scales. The time evolution of the rotation pa- 
rameter /3 of the models, which are most promising in this respect, is shown 
in Fig. 3.18. In particular, they have investigated the model with A = 10^ cm, 
= 0.04 and A = 1.28 (solid curve in Fig. 3.18), which reaches a maximum 
value of /?max = 0.348 during collapse and is above the dynamical stability 
limit {p > ^(iy„ — 0.27) for 1.2 ms. At t = 28.2 ms they have mapped the 
axisymmetric model computed on a 2D spherical grid (360 radial and 90 an- 
gular zones) onto a 3D multiply-nested Cartesian grid (5 nested grids with 
64^ X 32 equidistant zones each). This moment of time is about 2.5ms be- 
fore the maximum rotation rate is reached and about 2 ms before the model 
becomes dynamically unstable against non-<ixisymmetric deformations. The 
axisymmetric model was perturbed by imposing a 5% density perturbation 
depending solely on the azimuthal angle 0 about the rotation axis of the form 
Sg/g oc sin(m0) with m = 3 in the whole computational volume. Note that 
an additional small amj)litude perturbation (^ 1%) is caused by the mapping 
procedure itself. 

The development of the instability is shown in Fig. 3.22. Due to its rapid 
and strongly differential rotation the collapsing core develops a ring-like den- 
sity distribution, which is clearly visible in the first snapshot. A few millisec- 
onds after the model has been perturbed an m = 3 mode shows up in the 
density distribution. Note that although the m = 3 perturbation has been 
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imposed on the whole computational grid, it grows only in the neighborhood 
of the ring-like density maximum. The further evolution is characterized by 
the development of three density maxima (“lumps”) forming out of matter 
of the ring-like structure. The density contrast between the “lumps” and 
other matter at the same radius exceeds a factor of ten. Subsequently the 
three “lumps” , which reflect the initially imposed m = 3 mode of perturba- 
tion, merge within about 5 ms into a bar-like (i.e.,m = 2) density structure, 
which according to linear stability analysis is the most unstable dynami- 
cal mode (e.g.,Tassoul 1978). The overall evolution is further characterized 
by intensive hydrodynamic activity produced by the twirling-stick action of 
first the “lumps” and later of the “bar”. Most notably are trailing “spiral 
arms” causing mass and angular momentum to be transported away from 
the non-axisymmetric, high density regions (for further details see Rampp et 
aL 1997). 

The gravitational wave signal obtained in the 3D simulation is comparable 
to that found in the corresponding axisymmetric model. Further simulations 
with different initial models and different types of perturbations as well as 
simulations covering later epochs of the core’s evolution towards a rapidly(?) 
rotating, tri-axial(?) neutron star are definitely necessary. Studying the latter 
problem will, however, require other (implicit, relativistic) hydro-codes and 
a better treatment of the microphysics, particularly, of the equation of state 
and the neutronization of core matter. 



4. Hydrodynamics and Thermonuclear Burning 

The discussion in this section is an updated and modified version of that 
originally published in preprint form in an extensive review by Fryxell, Muller 
& Arnett (1989). The review was to be published as a contribution to a book 
on “Numerical Methods in Astrophysics” in 1989, but unfortunately the book 
has not appeared. Part of the review can also be found in Muller (1994) and 
is incorporated into chapters 4 and 11 of Arnett (1996). 

Much of our understanding of astronomy and astrophysics is insepara- 
bly connected to our understanding of hydrodynamics and nuclear burning. 
Thermonuclear processes and gravitational binding are the two major sources 
of energy for astronomical phenomena. Where in astrophysics do we find nu- 
clear burning? One of the most obvious examples is the Sun, representing the 
great majority of observed stars which calmly burn their nuclear fuel as ob- 
jects in a state of quasi- hydrostatic equilibrium. Even the most placid of such 
stars can have its evolution changed by the action of slow mixing currents, 
acting over the long lifetime of the object. More spectacular are those objects 
exhibiting explosive phenomena and dynamic stages of evolution, such as no- 
vae, supernovae, thermonuclear shell flashes. X-ray bursts and 7 -ray bursts. 
Here the flow is often supersonic. 
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Most of the observed mass in the universe is in the form of ionized plasma. 
The release of energy in such material gives rise to changing pressure gradients 
and so to hydrodynamic motion. Such fluid motion, in turn, will transport 
ashes and fuel to and from the flame zone. This gives rise to a subtle and 
difficult problem. The degree of difficulty of the problem is closely related 
to the nature of the coupling between the nuclear burning and the hydrody- 
namic flow. Special cases, in which this coupling is simpler, can be treated 
more reliably. First, nuclear reactions release energy in the form of radia- 
tion and kinetic energy of particles; at stellar densities this energy acts to 
heat the medium in which the reactions occur. The heating can modify the 
pressure, and thereby modify the flow. Second, nuclear reactions change the 
composition of the material, which in turn may modify its equation of state 
and its opacity, which in turn may modify the pressure and the heating, and 
again the flow. Conversely, the flow may remove ashes and bring fuel to the 
flame, countering the natural tendency of the flame to exhaust its fuel. The 
composition is specified by a set of abundances Yi for i species of nuclei and 
particles, where it is convenient to define Yi as the ratio of the number of 
particles of type i in some reference volume to the number of nucleons in 
the same volume. Often this coupling is dominated by a few nuclear species, 
and recognition of this gives rise to a set of approximations related to min- 
imum networks and post-processing. The idea is to identify the key burning 
processes {minimum network), calculate them coupled to the hydrodynamic 
flow, then use those conditions to recalculate (post-process) the entire set of 
nuclear processes. Larger networks (i.e.,more processes) require more effort 
to calculate, but if well chosen - give more accurate results. Doubling the 
size of a network without including a key process will simply give a more ex- 
pensive error. For a discussion of minimum networks and their use in stellar 
evolutionary models see Arnett & Thielemann (1985), Thielemann & Arnett 
(1985) and Arnett (1996). 

4.1 Time Scales 

Analysis of the characteristic time scales is crucial for understanding the 
physics of the problem. To begin we consider the order of magnitude of some 
cross sections of interest. 

For two particles of charge Zie and Z 2 C we may define a Coulomb radius 
^Coui from ZiZ 2 €^ /2rcou\ ~ E, where E is an appropriate energy. For 
example, for an atom that energy would be the energy of the bound electron, 
so E — p^/2me- Relating the momentum p to the radius, using the de Broglie 
wavelength, we have a cross section of order (Tatom — i:a^Z\Z 2 ~ 0.88 x 
10~^^Zi.Z’2 cm^, where oq is the radius of the first Bohr orbit. However, at 
temperatures at which nuclear processes can occur, atoms will be ionized. 
Taking E = \kT gives cr coui = 1-01 x 10~^ZiZ2/T^ cm^, which for higher 
temperatures is smaller than the previous estimate. Similarly, if the nucleus 
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hcLS a radius r = 1.4^3 Fermi, this gives a nuclear cross section of anuc = 
7rr2 = 4.5 x cm^. 

These estimates are extremely crude. These quantum mechanical sys- 
tems exhibit resonance behavior so that the cross sections vary enormously. 
Equating a^uc cr coui for typical nuclei of astrophysical interest gives a 

temperature of about 10^^ degrees Kelvin, higher than is generally of interest, 
which suggests that Coulomb scattering of ions is more common than nuclear 
interactions. A more accurate estimate generally gives stronger support for 
this idea. Consider an extreme example: ^^C +^^C at a center of mass energy 
of 3.79 MeV has a measured reaction cross section of a = 3.1 x 10“^^ cm^ 
(Mazarakis & Stephens 1973). This corresponds to a stellar temperature of 
T — 4 X 10^ K, and cTnuc/c^Coui = 1.5 x 10“^. Because this number is small, we 
expect the ions to establish an equilibrium velocity distribution, so that we 
have the great simplification of dealing with the reaction rates for a thermal 
gas. 

At high temperatures (T > 10® K), radiative diffusion is dominated by 
Thomson opacity. Since aThom = f^rr ^ = 0.665 x 10"^^ cm^ , this cross section 
equals acoui at T = (|ZiZ 2 )^/^ x 10^ K. For lower temperatures, typical of 
ignition conditions, heat diffuses faster than composition. The time scale 
for diffusion of heat through a distance Sr is rdiff = (5r)^/Au, where t; is 
the velocity of diffusing objects and A = 1/Na is the mean-free-path. For 
highly degenerate matter, electron conduction is more effective than radiative 
diffusion at transporting energy. The form of the time scale for diffusion of 
composition is identical to Tdiff above, but because the velocities of the nuclei 
are smaller and their Coulomb cross sections higher, this time scale is longer. 
Thus we can concentrate on the ignition of new fuel by thermal heating rather 
than the action of superthermal particles, a significant simplification. 

There are two separate time scales for burning which must be considered. 
The first is the ignition time scale of the fuel which is defined to be the 
temperature e-folding time 

Tt — T jT CvT/^nucj 

where 6nuc is the energy release rate of the nuclear processes, and Cv is 
the specific heat. Because charged-particle reactions are heavily modified by 
Coulomb barrier penetration, this time scale strongly decreases with increas- 
ing temperature. The second important time scale is the burning time^ i.e., the 
time to significantly reduce the abundance of fuel, which is defined as 

ri=XijXi = YilYi, (4.2) 

where X{ is the mass faction of species z, and Y{ is the mole number, obtained 
by dividing the mass fraction by the atomic weight of the species. In simple 
cases, this differs from tt by the ratio of the thermal energy content to the 
Q-value for the reaction (per unit mass burned). For example, for ^^C+^^C 
this ratio is 0.25 Tg, where Tg is temperature in units of 10^ K; this is of 
order unity at explosive temperatures. For comparison, the ratio is about 
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0.005 for hydrostatic hydrogen burning by the CNO cycle. This ratio can 
also approach zero in explosive situations, if the matter is degenerate. In this 
case the specific heat approaches zero, so that consumption of a small amount 
of fuel gives a large change in temperature. 

Finally, two time scales associated with hydrodynamic motion must be 
considered. The first is the time for a region to react to a pressure imbalance, 
which is taken to be the sound travel time 

"^hyd ~ (4*^) 

where Cg is the local sound speed. Finally, there is the time for a convective 
element to move through the region in which convection occurs (obviously an 
oversimplification of a complex process). This convective time scale is defined 
by 

^conv “ ^^conv/'^conv? 

where (Jrconv is the width of the convective zone and Uconv is the typical 
velocity of a convective blob. 

Depending on the relative size of the various time scales, very different 
requirements for the numerical treatment arise. If the nuclear time scales 
Ti are all large compared to Tconvj one might approximate the convective 
zone as uniform in abundances, which slowly evolve on the nuclear time 
scales. If some of the are shorter than Tconvj it is not correct to ignore 
the abundance gradients that this implies. In particular, these gradients may 
interact with the burning to modify the convective flow itself. Silicon burning 
is an interesting case of this: the time scales for a, n, and p, which maintain 
the quasi-equilibrium, are shorter than Tconv, while the for ^®Si is longer. If 
the nuclear time scales are all small compared to Tconv ) the problem simplifies 
again, and each region is loosely coupled to its neighbors. For more dynamic 
problems, such as pulses or explosions, the convenient fiction of steady state 
convection is untenable, and the hydrodynamics must be treated as an equally 
important aspect of the problem. 

4.2 Types of Burning 

Many different types of burning occur in astrophysical problems. The type 
which occurs in a particular problem is determined by the ratios of the time 
scales described above. Burning can range from quiet hydrostatic burning, 
such as occurs in the center of most stars, to the explosive burning which 
can occur, for example, in models of thermonuclear supernova explosions 
(i.e., SNe la). Burning can occur in laminar or turbulent flows, the fluid mo- 
tion being produced either by external causes and/or by the burning itself. 
Hence, the interaction between burning and flow must be considered. 
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4.2.1 Hydrostatic Burning. This is the least violent form of burning and 
occurs in stars that are hydrostatic (in radiative burning zones). In this case, 
the reaction rate is slow enough that energy can be transported from the 
burning region by radiation or conduction as rapidly as it is generated. A 
similar case is a subsonically convective flame region, in which heat and ashes 
are removed by slow convection, and new fuel brought in. In other words, 
n ^ Tdiff or Ti ^ Tconv These are the conditions which holds in stars such 
as the Sun over most of their lifetimes. The star remains in equilibrium, 
quietly burning its fuel, while the energy produced is transported to the 
star’s surface, providing its luminosity. This process will continue until the 
fuel is exhausted, at which time the nuclear reactions can no longer maintain 
sufficient heat, and hence pressure support, to prevent the center of the star 
from contracting. The temperature and density at the center of the star will 
continue to increase until the conditions axe reached at which the reaction 
products of the previous burning ignite. Depending on the situation, this 
burning can be either another hydrostatic phase or an explosive event, 

4.2.2 Detonations. At the other extreme, detonations are the most violent 
form of burning one encounters. Astrophysical detonations generally occur 
only under degenerate conditions. During the initial stages of a thermonuclear 
runaway, before the temperature rises significantly, the reaction rate is still 
relatively small. In nondegenerate matter, the pressure increase produced by 
the reactions will cause the burning region to expand and cool, preventing 
the runaway from proceeding. In other words, tt > T'hyd- On the other hand, 
if the matter is degenerate, the temperature increase created by the burning 
will not create a significant increase in pressure. Thus the temperature will 
continue to increase until the matter becomes nondegenerate. At this point, 
the energy generation rate will be too large for hydrodynamic motion to 
stop it, and an explosion will result. If the resulting shock is sufficiently 
strong to raise the fuel above the ignition temperature, a detonation wave 
will propagate outwards from the point of ignition. 

In its simplest form, detonation wave theory is easily understood (see 
e.g.,Courant & Friedrich 1948; Fickett & Davis 1979). As the shock propa- 
gates into the unburned fuel, it compresses and heats the material beyond 
the ignition point. Immediately behind the shock is the reaction zone, in 
which the fuel burns. In detonation theory the width of the reaction zone 
is frequently neglected, so that the detonation front is treated as a sharp 
discontinuity (Fig. 4.1). In this case jump conditions can be derived for the 
change in the hydrodynamic variables across the front in much the same way 
as is done for a simple shock. Note, however, that in order to obtain the post- 
detonation state one must first choose a detonation velocity, because unlike 
the case for simple shocks, the front velocity is not determined from the jump 
conditions. Moreover, one-dimensional steady detonation theory is oversim- 
plified, as real detonations are observed to have inherent three-dimensional 
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final detonation state 
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Fig. 4.1. Simplest model of 
a detonation wave: The width 
of the reaction zone is ne- 
glected, i.e., the detonation front 
is treated as a sh 2 trp discontinu- 
ity 



structure (cellular detonations) and can propagate in a non-steady manner 
(galloping detonations) (see, e.g.,Fickett & Davis 1979). 

Depending on the value chosen for the detonation speed no, one or two 
solutions exist. In the latter case the two possible solutions correspond to 
strong and weak detonations. A strong detonation propagates at a speed 
slower than the post-detonation sound velocity with respect to the fluid be- 
hind the shock, i.e., D < U 2 +C 2 (the subscript 2 denotes the post-detonation 
state), so that disturbances generated behind the front will eventually catch 
it. Thus this solution is unstable. Weak detonations, which propagate faster 
than the post-detonation sound velocity with respect to the fluid behind the 
shock, i.e.jjD > U 2 -f C 2 , are generally considered to be unphysical except 
under certain special conditions (Fickett & Davis 1979). 

The detonation which usually occurs in nature is the one corresponding to 
the propagation speed at which only one solution exists. This unique solution 
is called a Chapman- Jouguet detonation. It propagates with a speed equal to 
the sum of the post-detonation fluid velocity and sound speed, i.e., 

Dcj=W2+C2. (4.5) 

By assuming that the front propagates with this velocity, the post-detonation 
state is completely determined. Both the pressure and the density increase 
across the front, while in the frame in which the shock is stationary, the fluid 
velocity decreases. 

Consider a detonation in a rigid tube followed by a piston with a fixed 
velocity. Two types of detonations which have somewhat different profiles 
can occur. If the velocity of the piston which generates the detonation is 
larger than the fluid velocity of the Chapman- Jouguet state, the result is 
an overdriven detonation. The structure in this case consists of the shock 
and reaction zone, followed by a constant state corresponding to the point 
at which all the fuel is depleted. This is a strong detonation, and therefore, 
the constant state behind the front does not correspond to the Chapman- 
Jouguet state. For smaller piston velocities, the detonation is unsupported. 
For this type of detonation, the reaction zone and the constant state behind 
the front are connected by a rarefaction, called the Taylor wave. Although 
this type of detonation propagates at the Chapman- Jouguet velocity, the 
Chapman- Jouguet state does not correspond to the constant state behind 
the front, but instead is the state at the top of the rarefaction. 
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Fig. 4.2. ZND detonation wave: 

The shock, which is taken to be 

' ' infinitely thin, is followed by a 

reaction zone reaction zone of finite width 



Although the above theory provides a satisfactory explanation for simple 
detonations, it has significant limitations because of the assumptions that 
the reaction rate is infinite and that the reaction zone has zero width. A 
slightly more complex treatment is used in the Zeldovich-von Neumann- 
Doering (ZND) model (Zeldovich 1940; von Neumann 1942; Doering 1943). 
This theory assumes that the shock, which is taken to be infinitely thin, is 
followed by a reaction zone of finite width (Fig. 4.2). The final state obtained 
after the fuel is completely burned is exactly the same as for the simpler 
theory described above. Additional complications in the theory arise due to 
multidimensional effects, such as cellular detonations and spinning detona- 
tions. However, these are beyond the scope of this discussion (e.g.,Fickett & 
Davis 1979). 

4.2.3 Deflagrations. Deflagrations represent a much less violent form of 
burning than detonations, but in many ways are more complex (e.g., Wilson 
1985; Zeldovich et al. 1985). They result when the burning is unable to pro- 
duce sufficient overpressure to create a shock which is strong enough to ignite 
the fuel. The motion of the front is usually very subsonic. Burning is initiated 
by the diffusive transfer of heat from the hot ashes behind the front into the 
cold fuel. Although, for the case of a thin firont, deflagrations must obey the 
same jump conditions as detonations, the propagation velocity now depends 
on the rate of heat transfer. 

Compared to detonations another major difference is that the pressure 
and density decrease behind the deflagration front, and in the reference frame 
in which the front is stationary, the velocity increases. In the case of a spher- 
ically symmetric deflagration, which begins at the origin, the velocity behind 
the front must eventually become zero to satisfy the boundary condition at 
the origin. The only way in which this can happen is if the deflagration is 
preceded by a compression wave which accelerates the material away from 
the front. The passage of the deflagration will then provide exactly the cor- 
rect jump in velocity so that the material behind the front will come to rest. 
This can happen since the deflagration velocity is subsonic, and therefore, 
there is communication between the origin and the rest of the flow by sound 



waves. 
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In the simplest theory deflagrations are described as flow discontinuities, 
which propagate subsonically (e.g.,Courant & Friedrich 1948; Landau & Lif- 
shitz 1959). If the flow is laminar, a physically more accurate description 
is that of a conductive or diffusive laminar flame, whose width and speed 
is determined by the rate of energy generation and the transport of energy 
and composition. In most situations, however, even this description is too 
simple, because deflagrations or subsonic flames are subject to a variety of 
instabilities, which can significantly influence their shape (wrinkling of the 
flame) and propagation speed. Moreover, because the propagation speed of 
the flame is subsonic, flame and flow do interact. In particular, if burning 
occurs in a convective or turbulent environment, depending on the relevant 
length scales and time scales, physically quite different combustion regimes 
must be considered (see next subsection). 

Unlike the case of Chaprnan-Jouguet detonations, where it is possible 
to compute the exact propagation velocity, the propagation speed of defla- 
grations can only be estimated from dimensional reasoning (Landau & Lif- 
shitz 1959; Buchler et ai. 1979; Fryxell & Woosley 1982; Nozakura et ai. 1984; 
Woosley 1986). For the simplest case of a laminar front which propagates as a 
result of radiative diffusion or conduction, it is fairly easy to obtain an order 
of magnitude estimate for the velocity of the wave. The width of the defla- 
gration can be approximated by setting the diffusion time scale Tdiff equal to 
the burning time scale Ti. Thus the width of the front is given by 

S ^ y/XcTi , (4.6) 

where A and c are the mean free path of photons or electrons and the speed 
of light, respectively. The (laminar) velocity of the deflagration can then be 
estimated as 

£>Iam ~ ~ \/A^ . (4.7) 

n 

Using an implicit radiation hydrodynamics code, Timmes & Woosley 
(1992) have determined the propagation velocity, the width, and the den- 
sity structure of conductive deflagrations in degenerate carbon-oxygen and 
oxygen-neon-magnesium compositions under conditions typical of SN la pro- 
genitors {g ~ lO^^gcm”^ and T ~ 2 x 10^ K). At these conditions, one ex- 
pects from Eq. (4.6) that for both compositions the flame width S is of the 
order of 10“^ cm. This implies a flame speed of a few lOkms"^ In their 
numerical experiments, Timmes & Woosley (1992) constructed spheres with 
majsses between 10 g and 100 g using extremely fine zoning (~ 10~® cm). The 
spheres initially had constant temperature and density. The innermost 10 
zones of a sphere were then artificially incinerated, giving (after a short ini- 
tial transient) rise to a propagating isobaric, laminar, conductive flame, whose 
steady state physical properties were determined. Timmes & Woosley (1992) 
found that the laminar flame speed in a degenerate carbon-oxygen mixture 
is given with about 10% accuracy by the expression 
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(4.8) 



in the density range lO’^gcm"^ < Q < lO^^ gcm”^, which is consistent with 
the earlier result of Woosley (1986). 

As the laminar flame speed in a degenerate carbon-oxygen mixture is very 
subsonic (Diam ^ 10"^ Cs), it is generally agreed that it is too slow to produce 
an explosion with the observed properties (see Sect. 3.1). However, in SNela 
thermonuclear burning takes place in an environment in which gravity plays 
a role and drives Rayleigh-Taylor instabilities (Muller & Arnett 1982, 1986; 
see also Sect. 3.4). In addition, the flame is subject to the Landau-Darrieus 
instability (Landau 1944; Darrieus 1938) and to the Kelvin-Helmholtz in- 
stability (see, e.g., Landau & Lifshitz 1959). Hence, as the Reynolds number 
of the flow Re ~ 10^^ in SNela, turbulence develops and the flame front 
becomes wrinkled. The average rate of propagation is then no longer deter- 
mined by the laminar speed, but by turbulence. As a result the velocity of 
the front is increased compared to that of a diffusively propagated deflagra- 
tion (Dturb > -Diam) by a larger heat transfer rate in the distributed burning 
regime or by an increase in the surface area of the front in the flamelet burn- 
ing regime (e.g., Kerstein 1988; see next subsection for a definition of these 
burning regimes). 

Whether the acceleration of the flame by turbulence and/or intrinsic in- 
stabilities is sufficient to explain the observations, is one of the, if not the 
single most important question in SNela modeling. In order to answer this 
question, one has to consider turbulent combustion in a self-gravitating fluid 
(see next subsection). 

4.2.4 Turbulent Combustion. When burning occurs in a turbulent flow 
(partially or not at all caused by the burning itself), the combustion process 
can no longer be modeled by a plane laminar conductive deflagration. In- 
stead, a much more complicated type of burning must be considered, namely 
turbulent combustion, which is an extremely challenging and still unsolved 
problem (e.g., Wilson 1985; Zeldovich et al. 1985; Pope 1987; Clavin 1994). A 
detailed and comprehensive discussion of this complicated flow phenomenon 
is far beyond the scope of my lecture and being no expert in the field would 
also surmount my scientific competence. Thus, I have restricted myself to 
describe some aspects of turbulent combustion, which are of relevance for 
thermonuclear supernovae (i.e., SNela). Recently, both Niemeyer & Woosley 
(1997) and Niemeyer & Kerstein (1997) have carefully addressed this prob- 
lem. The subsequent discussion closely follows their analyses. 

Turbulent combustion in Chandrasekhar mass models for Type la super- 
novae can occur in two distinct burning regimes (Niemeyer & Woosley 1997). 
If the laminar flame structure is unaffected by turbulence on small scales 
(comparable to the flame thickness), turbulent combustion proceeds in the 
flamelet regime (e.g., Clavin 1994). Burning in the flamelet regime is char- 
acterized by a passively advected flame surface, which propagates through 
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Fig. 4.3. Turbulent combustion in the flamelet regime. The thin ’white surface 
marks the wrinkled and non-contiguous laminar flame, which separates fuel (dark 
regions) from ashes (grey regions). The size of the convective eddies is larger than 
the width of the laminar flame and fuel consumption occurs in an extended turbu- 
lent flame brush behind the flame. (Plot by J. Niemeyer) 



the medium at the laminar flame speed the large-scale structure of the 

flame surface being determined by turbulence, only (i.e., independent of mi- 
crophysics). Fuel consumption takes place in an extended region behind the 
fuel-ash boundary in the so-called turbulent flame-brush (e.g.,Pope 1987; 
Fig. 4.3). In Chandrasekhar mass models for SNela burning occurs in the 
flamelet regime during the early phase of the explosion when the flame is 
thin and fast (Timmes & Woosley 1992), and turbulence is weak. 

During a SN la explosion the laminar flame becomes thicker and slower 
as it proceeds to lower densities, while turbulence becomes stronger due to 
the nonlinear growth of Rayleigh-Taylor instabilities of the flame surface (see 
below). Eventually, the conditions for the flamelet regime are no longer satis- 
fied, and burning enters the distributed or stirred burning regime (e.g.,Pope 
1987). Turbulent eddies enter the flame and disrupt it. Turbulent transport 
dominates over conductive transport even on microscopic scales (Fig. 4.4). 

The transition from flamelet to distributed burning is well known in the 
combustion literature, but has only recently caught the attention of astro- 
physicists (e.g., Khokhlov et aJ. 1997; Niemeyer & Woosley 1997). Commonly, 
the Karlovitz number Ka is used to characterize the transition. This dimen- 
sionless number, which is of order unity at the transition, is either defined 
as a ratio of the reaction and flow length scales (Lewis & von Elbe 1961) 
or as a ratio of the reaction and flow time scales (Bradley 1993). Both defi- 
nitions are equivalent for gaseous combustion provided the Prandtl number 
Pr= v/k is about unity, where v is the kinematic viscosity and k is a ther- 
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Fig. 4.4. Turbulent combustion in the distributed regime. Convective eddies pen- 
etrate the flame (white to grey layer), which is no longer well defined. Fuel (blax:k) 
is consumed within turbulent eddies, which dominate over conductive transport. 
(Plot by J. Niemeyer) 



mal diffusion coefficient. Recently, it has been pointed out by Niemeyer & 
Kerstein (1997) that in SNela the Prandtl number is not of the order unity, 
but ranges from 10“^ to 10“^ (Nandkumar & Pethick 1984). Hence, a novel 
criterion for the transition to distributed turbulent combustion is required 
in degenerate white dwaxfs. Niemeyer & Kerstein (1997) propose to use the 
eddy diffusivity, evaluated at the scale of the flame thickness as the governing 
parameter; if it exceeds the microscopic thermal conductivity, fiamelets are 
considered to break up. In Chandrasekhar mass models of SNela the density 
^tr at which the transition occurs is of the order 10^ g cm“^. The actual value 
depends on the size of the turbulent velocity fluctuations Uturb and varies 
from ^tr « 5 X 10®gcm“^ for uturb ^ 10® cm/s to ^tr 3 x 10*^gcm“^ for 
^turb ^ 10® cm/s. These transition densities are roughly a factor of 5 smaller 
than those determined previously by Niemeyer & Woosley (1997), which has 
important implications for the so-called delayed-detonation scenario of SNe la 
(Khokhlov 1991; Woosley &: Weaver 1994). 

Let us now consider the flame instabilities (see previous subsection) , which 
generate the turbulence. In SNela these instabilities act on vastly differ- 
ent length scales. They modify the flame structure on scales ranging from 
about 10^ cm, which is a significant fraction of the white dwarf ^s radius 
(i^vvD ~ 10® cm), down to microscopic scales of the order 10“®cm, which 
are determined by the size of the laminar flame thickness and the viscous 
dissipation-scale. 
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The Rayleigh-Taylor (RT) instability dominates the dynamics of the 
flame on scales 10® cm ^ /rt ^ 10^ cm. It arises, because the density de- 
creases and the temperature increases behind a subsonic flame (deflagration). 
When the flame propagates outwards from the center of a star against the 
force of gravity, one has a low density bubble of hot ashes trying to push out- 
ward through the denser and colder unburned fuel. This arrangement is un- 
stable, similar to the case of a classical RT instability (see Sect. 3.4). The evo- 
lution of the RT instability is characterized by a short period of exponential 
growth during which the perturbation amplitudes become comparable to its 
wavelength, and then by the nonlinear stage where interaction among modes 
leads to a merging and fragmentation of structures (e.g.,Read 1984; Young 
1984). The shear flow occurring at the surfaces of rising bubbles and sinking 
spikes produces Kelvin-Helmholtz (KH) unstable regions along these surfaces 
(Niemeyer & Hillebrandt 1995a). Eventually, a turbulent mixing layer forms 
(Snider & Andrews 1994). The growth of the layer can be estimated by the 
Sharp- Wheeler model (Sharp 1984). In this one-dimensional model the non- 
linear stage of a RT unstable front is described by statistical means. Such 
an approach is justified, because the RT unstable front is characterized by a 
large number of structures with various scales. Numerical simulations based 
on the Sharp-Wheeler model (Glimm & Li 1988) show that the RT mixing 
region asymptotically grows at a rate of approximately 

/rt 0.05 , (4.9) 

where ^ At g the effective gravitational acceleration and At = (^>2 — 
Q\)I{Q 2 4- ^>i) is the Atwood number (see Sect. 3.4). This result agrees well 
with experiments (Read 1984) and full hydrodynamic simulations (Young 
1984). According to Eq. (4.9), a RT unstable front propagates with a velocity 



Urt O.l^eff ^ OC \/^eflf^RT • (4-10) 

The RT structures contain most of the overall kinetic energy of the ex- 
ploding star and evolve significantly during a SNIa explosion, i.e.,they are 
very anisotropic and non-stationary. Hence, burning and hydrodynamic flow 
occurring on the RT unstable scales can probably not be described by a 
simple parameterized model, but must be simulated directly (Niemeyer & 
Hillebrandt 1995a). 

The joint action of the large-scale RT and KH instabilities feeds a tur- 
bulent cascade (Richardson 1922), which extends all the way down to the 
Kolmogorov length 7 ?, at which scale dissipation due to microscopic viscosity 
becomes important. This scale is commonly called the dissipation scale. It 
can be estimated within the framework of Kolmogorov’s (1941) turbulence 
model, which assumes homogeneous, isotropic and incompressible turbulence. 
One finds t} = /rtRc”^^'^ (Landau & Lifshitz 1959). For Reynolds numbers 
Re « 10^^, which are typical of white dwarfs, g is of the order 10“^ cm, only. 
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In the intermediate range, i.e.,for scales with r/ < / /rt? the hydro- 
dynamic behaviour is dominated by the inertia of the fluid. In this so-called 
inertial range the famous Kolmogorov scaling law for incompressible turbu- 
lent velocity fluctuations holds (Kolmogorov 1941; Landau & Lifshitz 1959) 
provided two assumptions are fulfilled. Firstly, the properties of the fluid in 
this range must be independent of both the microscopic viscosity and the 
large-scale geometry. Secondly, the transport of kinetic energy from large to 
small scales is completely determined by its constant mean energy dissipa- 
tion value £. The scaling law, obtained under these two assumptions from 
dimensional analysis, is 

e oc u{l)^fl , (4.11) 

and hence the velocity fluctuations u{l) must scale consistently as 

u{l) oc . (4.12) 

As pointed out by Niemeyer & Kerstein (1997) turbulence generated by 
the Rayleigh-Taylor instability creates parallel cascades of kinetic and po- 
tential energy. The former cascade resembles that of Kolmogorov turbulence, 
where large eddies fragment into smaller ones. The potential energy cascade, 
which is only present in RT generated turbulence, is produced by the disrup- 
tion of buoyant regions of burned, hot matter leading to an additional flux 
of density fluctuations from large to small scales. This density cascade may 
dominate over the eddy cascade (L‘vov Sz Falkovitch 1992) and gives rise to 
scaling laws for the density and velocity fluctuations given by (Bolgiano 1959; 



Obukhov 1959) 




q{1) oc 


(4.13) 


and 




u{l) oc. 


(4.14) 



respectively. Experimental and analytical evidence in support of and against 
the above Bolgiano-Obukhov scaling can be found in Borue & Orzag (1997). 

Another length scale of importance in turbulent combustion is the Gibson 
scale /cibs introduced in the context of turbulent flame modeling by Peters 
(1988). At scales below the Gibson scale hydrodynamic effects on the flame 
are negligible, because the slow, small turbulent eddies present at those scales 
are burned before they can alter the flame structure significantly. The Gibson 
scale is defined by the relation 

u(/Gibs) “ ^lam • (^’l^) 

On microscopic scales the flame is intrinsically unstable due to the 
Landau-Darrieus (LD) instability. For the case of SNe la, this instability has 
been explored in detail both analytically and numerically by Blinnikov & 
Sasarov (1996) and Bychkov & Liberman (1995), and its onset was simu- 
lated (in 2D) by Niemeyer & Hillebrandt (1995b). These studies show that 
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on scales larger than the Markstein, or critical length lent ~ 100(5 (Mark- 
stein 1951), flames propagating towards the surface of the white dwarf are 
unstable to the LD instability on all wavelengths. In the nonlinear regime the 
LD instability leads to the formation of cusps at the points where the flame 
front self-intersects. This gives rise to a quadratic damping term (Zeldovich 
et aL 1985), which stabilizes the flame surface once cells with a stationary, 
scale-independent amplitude form. The speed of the cellular surface is de- 
termined by the increased surface area (e.g.,Niemeyer & Woosley 1997), In 
the absence of gravity, there is no known upper limit for the largest scale of 
cell formation for a cellular flame front driven purely by the LD instability. 
This is no longer true, when the effect of gravity is taken into account. Then 
the cell structure becomes scale dependent and the nonlinear stabilization of 
the flame becomes ineffective (Khokhlov 1995). In a turbulent environment 
cell formation ceases on the scale I where the turbulent velocity fluctuations 
u{l) become comparable to the speed of the cellular front Uceii(0- The eddy 
turnover time is then comparable to its interaction time with the front, or in 
other words, turbulent perturbations of the front are comparable to the cell 
size. Hence, the transition between cellular and turbulent burning occurs at 
the respective Gibson scale defined as u(/Gibs) = ^ cell (^ Gibs)* 

This concludes the discussion of the instabilities operational in SNela. 
However, in order to describe the evolution of thermonuclear burning in 
SNela, we need to address a final point. How is the effective speed of the 
burning front in the turbulent regime (Dturb) related to the turbulent veloc- 
ity (u; e.g.,Eq.4.14). This is a very important and non-trivial question, as 
the two velocities refer to a priori different physical processes. If the turbu- 
lent velocities are large (i.e.,u ^ D\^rn), one commonly assumes Dturb = u. 
Hence, one assumes that the effective burning velocity scales with the tur- 
bulent velocity. This scaling assumption is confirmed, e.g.,by Niemeyer & 
Kerstein (1997). 

From the discussion above the following picture for the propagation of 
thermonuclear burning in Chandrasekhar mass models of SNela is suggested. 
(Niemeyer & Woosley 1997). Initially, burning propagates in form of a lami- 
nar conductive flame of thickness S with speed Dfront = L)iam (see Eqs. (4.7) 
and (4.8)), until the smallest cells appear due to the Landau-Darrieus in- 
stability at /crit ~ 100(5. In this so-called cellular regime the propagation 
speed of the front is no longer scale independent, but approximately scales 
as Dfront(0 (Blinnikov & Sasarov 1996; Niemeyer & Woosley 1997). 

At the Gibson length /cibs (Eq. (4.15)) the turbulent velocities generated 
by the (macroscopic) gravity driven RT instability become comparable to 
the cellular flame speed. Nonlinear cellular stabilizations fails, and burning 
changes from the cellular regime to the (inertial) Kolmogorov regime. Un- 
der the scaling assumption Dturb = w, this implies that for scales I > /cibs 
the turbulent flame speed is determined by the eddy turnover time on ev- 
ery scale. Hence, Dfront(0 oc (Eq. (4.12)) in case of Kolmogorov scaling, 
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or Z^front(0 ^ (4’14)) if Bolgiano-Obukhov scaling applies. Finally, 

above /turb/RT 10^ cm, which marks the boundary between the turbulent 
burning regime and the buoyancy-driven RT regime (Niemeyer & Woosley 
1997), burning propagates according to Dfront(0 (Eq. (4.10)). If tur- 

bulence obeys Bolgiano-Obukhov scaling, the division into inertial and RT 
range becomes obsolete, because buoyancy effects are already taken into ac- 
count in that scaling. The actual values of the length scales at which the 
various transitions occur depend on density. Finally note that the above de- 
scription of the evolution changes, if burning enters the distributed burning 
regime at some stage. 

4.3 Nuclear Reaction Networks 

Consider an element of matter at sufficiently high temperature and density so 
that thermonuclear reactions occur. In astrophysical situations a number of 
different types of reactions can be important. For example, consider reactions 
of the sort X(p,7)F, or equivalently X -h p F -h 7, where X is the target 
nucleus, Y the product nucleus, p the bombarding particle (a proton) and 7 
an exit channel particle (a gamma ray). The reactions which one must deal 
with most often are 

(p,t) (p,n) 

{p,a) (n,7) 

(n,a) (a, 7) 

and their inverses 

(7,p) (n,p) 

(a,p) (7,n) 

(a,n) (7, a) 

although several other types of reactions may be important as well. During 
hydrogen burning, reactions involving and ^He must be considered. The 
“heavy ion” reactions -h and dominate carbon 

and oxygen burning stages; the more important exit channels involve emis- 
sion of (1) a proton, (2) an alpha, (3) a neutron and (4) a deuteron (for oxygen 
burning). The most important three body reaction is S^^He) ^^C + 7. The 
slower weak interactions are of fundamental importance because they alone 
change the ratio of neutrons to protons in the system. 

Consider the rate of change in the number density rii of species z. The 
nonlinear differential equation which describes the behavior of rii may be 
written (e.g., Arnett 1996) 

hi = ^ a{j, k) < jk > njUk 

j j,k 

+ ^ Ci{j, k, 1) < jkl > UjUkTii . 



(4.16) 
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The first term in Eq. (4.16) describes changes of rii due to ^-decays, electron 
captures (EC) and photo-disintegrations, the second term describes changes 
due to two-body reactions, and finally, the third term describes changes due 
to three-body reactions. Accordingly Xj is the weak interaction rate (/1-decay, 
EC) or the photo-disintegration rate, and < jk > and < jkl > denote the 
thermally averaged products of cross section and relative velocity in the cen- 
ter of mass system for the two-body and three-body reactions, respectively 
(e.g., Fowler et ai. 1975). The coefficients ci are short notations for 



CiU) = 

Ciijyk) = 



Ci{j,k,l) = 



±Ni, 

Ni 

Nj\Nk'.Ni\ ’ 



(4.17) 



where Nj,Nk,Ni are absolute numbers indicating how many particles of type 
z, j, A:, I are involved in the reaction and the Nj \ etc., prevent double counting 
of reactions with identical particles. The sign depends upon whether the 
nucleus i is produced (+) or destroyed (-) in the reaction. 

If an expansion or contraction occurs, as is generally the case in astrophys- 
ical applications, Eq. (4.16) must be modified. In order to avoid inconvenient 
and perpetual bookkeeping, it is useful to consider the number of species i 
per unit of some conserved quantity, instead of per unit volume. Nucleon 
number is such a quantity, being conserved even in the relativistic limit. 

Avogadro’s number N\ is defined as the number of atoms of some species 
i which makes Wi grams, where Wi is the atomic weight of species i. For 
helium, Wue is 4.0026 for example. If rrii is the mass of a nucleus of species 
i as measured in the lab, then for a mixture consisting only of species i, 
the mass density is qq — rii rui and N a — Wi. For a mixture of species, 

^0 ” XJiTli TTii. 

This quantity changes with composition and is not relativistically invari- 
ant. However the related quantity 



A-i 



(4.18) 



avoids these difficulties. We have replaced the mass m* by A{ (the atomic 
number) times the atomic mass unit. Under laboratory conditions q almost 
equals We now define the nucleon number fraction for species i to be 



Xi = 



7li 



(4.19) 



which is just the fraction of the nucleons tied up in the form of particles of 
species z. In the astrophysical literature this and a similar quantity involving 
^ 0 , not g, are both called the mass fraction. From Eqs. (4.18) and (4.19), 



- 1 . 



(4.20) 
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A related quantity is 



Xi Tli 



(4.21) 



which is a ratio of the number of nuclei of species i to the total number of 
nucleons in the system. Now, the total number of nucleons is 



n = Eifii Ai , 



(4.22) 



so 

i7.y. = ^ , (4.23) 

n 

which is not equal to unity, in general. In actual calculations the distinction 
between g and is usually ignored. Using Eq. (4.21), Eq. (4.16) becomes 

Yi = + E < jk > YjYk 

j 

+ E < jkl > YjYkY, . (4.24) 

j.k.l 

Unlike the number density n^, the specific abundance Yi does not change 
when only an expansion or contraction occurs, but requires the action of 
nuclear processes or mixing. Thus we have a set of coupled, nonlinear ordinary 
differential equations. Such a system of equations is said to define a nuclear 
reaction network. 



4.3.1 Explicit and Implicit Solution Methods. The reaction network 
equations often behave as stiff differential equations (e.g., Gear 1971). In 
general they have rate terms which contain second and higher powers of 
the abundances being sought. Thus they are nonlinear. Only numerical tech- 
niques can provide solutions to a general set of reaction network equations. 
Consider the system of equations 

^ = fi{Q,T,Yu...,YM), i = (4.25) 

where fi{g, T, Yi , > 2 , ^3 • • •) is the right hand side of Eq. (4.24) and N is the 
number of species considered in the reaction network. Suppose one approxi- 
mates the time derivatives by finite differences 



m ^ Yi{t-hSt)-Yijt) 
dt St 



(4.26) 



This implicitly assumes the change in variables to be small during the time 
interval St. Two simple choices for evaluating fi are apparent: 

Yi{t + St)-Yi{t) 



St 



or 



= m 

= fiit + St), 



(4.27) 

(4.28) 
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i.e., , one can evaluate the reaction rate terms at time or at time t St. 
Both are accurate to first order in St] Eq. (4.27) results from a Taylor expan- 
sion from time while Eq. (4.28) results from a similar expansion about time 
t 4- St. One might also use some average of the two, or estimate fi{t + St/2), 
and arrive at an expression accurate to second order in St. One must distin- 
guish between accuracy and stability] both are needed for a correct solution. 
Because one approximates a derivative using a truncated Taylor expansion, 
the resulting difference equation may have extra solutions in addition to those 
of the differential equation (e.g., Potter 1973). One could say that the result- 
ing numerical garbage is due to numerical instability and consider it to be a 
mathematical problem, or regard the difference equations to be incorrectly 
formulated to represent the physics involved. The latter point of view is rarer, 
but often of more practical value. 

With the choice denoted by Eq. (4.27), called forward or explicit differ- 
encing, one has N uncoupled equations for N unknowns. All Yi, density and 
temperature at t St may be found individually in terms of the Yi, den- 
sity and temperature at time t. The latter quantities are just those one starts 
with. Although the solution of these equations is truly elementary, difficulties 
are encountered near steady state and equilibrium (e.g., Arnett 1996). 

The approximation implied by Eq. (4.28) is called the backwards differ- 
encing, implicit differencing or implicit difference form. In this case one has 
to solve a coupled, nonlinear system of equations by matrix manipulation in 
order to obtain the abundances at the new time step. A further complication 
arises when the network has to be solved for time-dependent density and 
temperature (see Sect. 4.4.2). 



4.3.2 Linearization and Matrix Inversion. Let us consider the solution 
of the network equations at a constant density and temperature . If one uses 
implicit differencing, the fi{t-\-Stys contain terms which are at le 2 LSt quadratic 
in the unknowns. Products of the sort Yj{t-\-St) Yk{t-\-St) give a nonlinearity, 
which means that no simple general solution can be obtained. However, in 
introducing finite differences to replace time derivatives in Eq. (4.26), one had 
to assume that 



dt 



St < Yi{t) 



Denote 



or 



Yiit^St) . 



(4.29) 



Aj = Yj(t + 6t)-Y,{t) . 

Then, 

Yj{t + St)Yt{t + 6t) = [Yj{t)+Aj][Yk{t) + Ak] 

« Yj{t)Ak + Yk{t)Aj + Yj(t)Yk{t) 

« Yj{t + 5t) n (t) + y, (t) Yk{t + 6t) 



(4.30) 



(4.31) 
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if one neglects terms above first order in Aj and Ak- Because this approxi- 
mation is only that which would be required for reasonable accuracy of nu- 
merical integration, the process of linearizing the reaction network equations 
does not really introduce any significant new restrictions. Essentially, one 
moves forward in time by analytic continuation; at each new time interval 
one updates the nonlinear coefficients, so that one obtains the nonlinear solu- 
tions. For three-body reactions, this linearization can be more restrictive, but 
predictor-corrector schemes or multistep iterations can solve these problems. 
Now upon linearization, one obtains 

Ai = St{ Zj Ciij)\j{Aj + Y^)+ (4.32) 

k) qNa < jk > {YjAk + YkAj + YjYk) + 

Y.j,k,i 0 < jkl > 

{YiYkAi + YjYiAk + YkYiAj + YjYkY)] , 

where all the abundances are evaluated at time t. 

Such an equation, linear in the unknowns (here the ^i’s), may be written 
for all N species. The equation for a species i contains a term in Ak for each 
species k which is connected to i by a reaction. Obviously the equations for 
“mediating” particles such as p, n and alphas have many terms. One may 
write all these equations succinctly in matrix form: 

AA = B , (4.33) 

where Ais a> column matrix in the A^s^ B a, column matrix in constants, and 
A a square matrix (of dimension iV). To better understand the structure of 
A, consider a system in which there is only an alpha particle chain; that is 

C{a, 7) (a, 7 ) E{a, 7) F{a, 7) G{a, . (4.34) 

If the order is alphas first, followed by six nuclei, the structure is illustrated 
visually as follows: 

/I 1 1 1 1 1 1\ 

1110000’ 

11110 0 0 

A= 1 0 1 1 1 0 0 . ( 4 . 35 ) 

10 0 1110 
1 0 0 0 1 1 1 

\1 0 0 0 0 1 1 / 

Here the I’s denote that the respective matrix element is nonzero. Because 
of the coupling of the alphas to all the nuclei, the matrix is not tridiagonal 
(i.e., all elements zero except those on the diagonal and their nearest neigh- 
bors). The fast solution schemes for tridiagonal matrices are not directly 
applicable (although the structure of the matrix can be used to assist solu- 
tion). Note that as the dimensions increase for a matrix with this or many 
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similar structures, the fraction of zero elements increases also, resulting in 
a sparse matrix. Because the time needed to solve a general matrix equa- 
tion such as Eq. (4.33) increases rapidly with the dimension, it is best to use 
as few reactions in a network as physically plausible. This method was first 
used in astrophysics for silicon burning (Truran et ai. 1967); and presented in 
detail by Arnett & Truran (1969). Wagoner (1969) used a second-order ver- 
sion for Big Bang nucleosynthesis. Using standard methods for solving linear 
equations, one calculates 

A = BA~^. (4.36) 

As the new abundances are Yi{t -(- 6t) — Yiit) + for each species i, one 
has advanced one time step St. Examining the fractional change Ai/Yi and 
other criteria such as change in density, temperature or the size of the last 
time step, a new time step is chosen. New temperatures and densities are 
calculated (if they were not included in the matrix solution) , and new reaction 
rates obtained. The new matrices A and B are constructed, and solved, and 
so on around the cycle until the evolution of the abundances has gone on as 
far as desired. 



4.3.3 Iterative Solution. Instead of linearizing Eq. (4.24) as explained in 
the previous section, one can also solve the network by Newton’s method. This 
of course requires the calculation of the Jacobian of the nonlinear system. For 
constant density and temperature, Eq. (4.24) can be rewritten in vector form 
as 

9 - f = 0 , (4.37) 

where g^f and Y are column vectors (of dimension N), and where the su- 
perscript on t denotes the time step number. According to Newton’s method, 
Eq. (4.37) leads to 

= [y'(i"+')] , (4.38) 

which is a system of linear equations for the unknown (column) vector 

^i+l ^ y(+l(jn+l) _ (4 39^ 



is the Jacobian of the l-th iteration step and is given by 



St ay/(t"+i) 



(4.40) 



Using an initial guess Y®(P“‘”^) = and an appropriate St, Eq. (4.38) 

will typically converge to a relative accuracy of 10“® after about I = 3 iter- 
ations. The linearization of the network discussed in the previous section is 
the equivalent of performing just one Newton iteration. However, the size of 
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the time step must then be limited such that the omitted higher order terms 
are negligible (Arnett & Truran 1969). 



4.4 Coupling Reaction Networks and Hydrodynamics 

If a nuclear reaction network is used within a hydrodynamic code, various 
interdependencies of the rate equations and the hydrodynamic equations ex- 
ist. 

4.4.1 Local Coupling. First, there is a (spatially local) coupling of both 
subsets of equations via the energy release (or consumption) due to nuclear 
burning. The reactions lead to an increase (or decrease) of the temperature 
and consequently of the pressure described by the source term in the energy 
or entropy equation, which in turn leads to an increase (or decrease) of the 
reaction rates. Since the reaction rates are extremely sensitive to temperature 
variations, a very strong nonlinear coupling results. The change of the internal 
energy, s, due to this coupling is given by (e.g., Muller 1986) 

^ = -9.644 X 10^^ Q Ami(? [ergcm"^s“^] , (4.41) 

i 

where Arrii = {rrii — Aimu)c^ and mu are the mass excess in MeV and the 
atomic mass unit, respectively. 

In stellar evolution calculations the standard procedure is to solve the net- 
work separately from the other evolution equations, i.e.,Eq. (4.24) is solved 
for given density, temperature and abundances obtained previously in the 
time step. The resulting energy release and abundance changes are taken 
into account in the next time step (e.g., Weaver et al 1978). 

This type of coupling (or better decoupling) of network and hydrodynamic 
equations can lead to instabilities when calculating nuclear transmutations 
while entering or leaving the regime of nuclear statistical equilibrium (NSE; 
for an explanation of NSE, see e.g., Clayton 1968). Therefore, Muller (1986) 
proposed the following method, which avoids these instabilities. To illustrate 
this method, let us write the network equations and the source term (for a 
fixed density) as 





..,Yn,T) = 0, 


(4.42) 




■ , Yn,T) = 0 , 


(4.43) 



where i = 1 , , . N, j ~ 1 . , . N, and N is the number of species. / and h are 
nonlinear functions of the arguments. If Eqs. (4.42) and (4.43) are discretized 
implicitly (see Eq. (4.28)), the variables from time level are advanced to 
time level 5t according to 



(4.44) 
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^n+l _ . . . , ) - 0 . 



(4.45) 



As the energy itself is some known function of Yi and T (via the equation 
of state), Eqs. (4.44) and (4.45) are a nonlinear system of equations for the 
N Yl unknown variables . . . , and Contrary to previous 

methods, in Muller’s (1986) approach the whole system is solved with the 
Newton technique, i.e., abundances and temperature are updated together. 
The resulting method is very robust and allows one to integrate the network 
equations into and out of the regime of nuclear statistical equilibrium without 
numerical problems. 



4.4.2 Spatial Coupling. A second type of (spatially non-local) coupling 
arises, because the composition as well as the density and the temperature 
are modified by the hydrodynamics and in particular by diffusion, convective 
mixing and by advection in the case of an Eulerian hydrodynamics code. In 
the following, the latter two cases are considered in more detail 

The conservation form for the equations of Eulerian hydrodynamics in 
two-dimensional Cartesian geometry with nuclear burning can be expressed 
by the vector equation (see also lecture by Le Veque) 



au dF(U) dG(U) _ 

The vectors U, F, G, and H are defined by 



(4.46) 
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(4.47) 



(4.48) 



where u and v are the x- and y-component of the fluid velocity Vy E is the 
total energy per unit mass, Xi is the mass fraction of species i, and P is the 
pressure. The quantities e and Xi represent the time rate of change of the 
specific internal energy e — ej g and mass fractions resulting from nuclear 
reactions. 

Eqs. (4.46), (4.47) and (4.48) show that the change of the abundances 
(source term gXi) is coupled spatially via the flow. Formulated differently, 
the reaction network now involves a comoving derivative 



'~dt at " 



where the LHS is given by Eq. (4.24). 



(4.49) 
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The V • VTi operator couples the flow to the fuel abundance. For a region 
of linear size (5r, the flow time scale is taow = v/Sr. If this is short compared 
to all of the nuclear burning time scales, then v ♦ VY{ « 0, and the flow re- 
gion approaches homogeneity. If we further assume that the flow is turbulent, 
cascading from large eddies down to a dissipation scale (e.g., Landau & Lif- 
shitz 1959), then diffusion at these small scales will remove the composition 
irregularities. This is the limit of complete microscopic mixing. 

For the first thermonuclear stages of stellar evolution, hydrogen and he- 
lium burning, cooling occurs by radiative diffusion. This sets the dissipation 
scale for small eddies in a convection zone. From our previous discussion 
(Sect. 4.1), this implies that the diffusion length for nuclei (f.e., composition) 
is longer than the scale for small eddies, but not enormously so. Complete 
mixing may be the appropriate approximation, at least for non-explosive 
phenomena. 

Later burning stages (C, Ne, O and Si burning) are cooled by i/g^e emis- 
sion. This speeds the burning and decouples the dissipation from the diffusion 
process. Instead of a homogeneous, mixed region, a more realistic picture of 
the convective zone might be an ensemble of independent “blobs” which move 
in and out of the flame, with little microscopic mixing. Most stellar evolu- 
tionary calculations to date still assume homogeneous (microscopic ) mixing. 

Oxygen shell burning is crucial for the late evolution of massive stars. Its 
nature affects, e.g., the size of the core which will collapse (Arnett 1994). In 
the oxygen-burning shell the sound travel time Thyd , the convective turnover 
time Tconvj and the nuclear burning time (see Sect. 4.1) are all of the same 
order as the evolutionary time, and nuclear energy sources and neutrino cool- 
ing are ongoing in convective flows. Meanwhile, it has become possible to 
directly simulate the complex thermonuclear and hydrodynamic behaviour 
of oxygen-burning shells in two space dimensions (Arnett 1994; Bazan & 
Arnett 1994; 1997). These simulations show the nature of convective burn- 
ing being dramatically different from that assumed in ID models. One finds 
mixing beyond convective boundaries determined by mixing-length theory 
(i.e., overshooting), hot spots of burning fuel in the oxygen shell and a con- 
vective velocity structure being dominated by plumes. 

Silicon burning, the last burning stage before core collapse, is even 
more difficult to simulate, because further complications arise due to quasi- 
equilibrium and electron capture (e.g., Arnett 1996). For silicon burning there 
are two classes of time scales: those relating to the slow transformation of Si 
to Fe, and those faster ones relating to the maintenance of quasi-equilibrium 
among the Si to Ca nuclei and free neutrons, protons and alpha particles 
(Clayton 1968). In a typical Si-burning convective zone, these two time scales 
straddle the convective turnover time, so the homogeneous approximation is 
incorrect. As a first step toward correcting for this effect, most evolution- 
ary calculations have assumed local quasi-equilibrium, and transported only 
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quasi-equilibrium global parameters (e.g^.,the Si/Fe ratio, or Si-Ca and Fe- 
peak abundances). 

In astrophysics, the most complicated interplay of nuclear burning and 
hydrodynamics is encountered when trying to simulate turbulent combus- 
tion (see Sect. 4.2.4). In thermonuclear supernovae this combustion processes 
has become a major research activity in astrophysics during the past decade. 
Various groups have performed two and three-dimensional simulations of det- 
onations, deflagrations, and deflagration-to-detonation transitions (Muller & 
Arnett 1982, 1986; Khokhlov 1991; Livne 1993; Arnett & Livne 1994a, b; 
Khokhlov 1994; Bravo & Garcia-Saenz 1995; Khokhlov 1995; Niemeyer & 
Hillebrandt 1995a, b; Khokhlov et ai. 1997; Niemeyer & Woosley 1997; Ruiz- 
Lapuente et ai. 1997 and references therein). Another major activity of multi- 
dimensional turbulent combustion modeling in astrophysics concerns novae 
(e.g., Shankar et ai. 1992; Shanlcar & Arnett 1994; Glasner & Livne 1995; 
Glasner et ai. 1997). 

Simulating these and other multi-dimensional combustion flows one al- 
ways has to grapple with the dilemma, whether one chooses a Lagrangian or 
an Eulerian description of the flow. 

Lagrangian methods avoid artificial mixing of nuclear species and diffu- 
sion of heat, and thus eliminate one of the major errors present in Eulerian 
codes (see also Sect. 2.2.1). However, severe numerical difficulties arise when 
using Lagrangian codes for problems where flows in more than one spatial 
dimension have to be simulated. The formation of vortices in the flow will 
cause large distortions in the Lagrangian grid and perhaps even grid tangling. 
Although this can be alleviated by periodic rezoning, the major advantage 
of the Lagrangian approach is then lost. The sudden mixing of species which 
occurs during the rezoning may provide even worse effects than the gradual 
mixing which occurs during the entire Eulerian calculation. 

Thus, for two or three-dimensional calculations, Eulerian methods are 
probably to be preferred, since the grid remains regular. However, the errors 
associated with the mixing of species must then be considered. The amount 
of mixing can be controlled to some extent by using an adaptive grid which 
moves with the species discontinuities. In addition, an implicit method may 
be required to move the grid correctly, which enormously increases the com- 
plexity of the code, as well as the amount of computer time required. Local 
adaptive mesh refinement can help to keep the size of the mixed region small, 
but can not eliminate the error entirely. If the fluid in the mixed region ex- 
plodes due to the numerical error, the results will still be qualitatively wrong. 
Multi-fluid calculations, which track the location of the interfaces between 
the various fluids, can perhaps solve the problem in simple cases, but for flows 
containing a large number of species, in which every zone may have a dif- 
ferent composition due to burning, such methods seemed to be impractical. 
However, recently a new efficient capturing-tracking hybrid scheme for defla- 
gration discontinuities based on the level-set approach (Markstein 1951) has 
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been proposed, which could make such multi-fluid flow problems tractable 
(Smiljanovski et al, 1997). 

4.4.3 Conservation of Species. When simulating hydrodynamic prob- 
lems involving nuclear burning it is crucial to maintain conservation of the 
partial density of each species, both globally and locally. If one uses a conser- 
vative formulation of the species equations (Eq. (4.46)), global conservation 
is guaranteed. However, changes to some of the difference methods are re- 
quired to ensure that the sum of the partial densities equals the total density 
in each zone (Fryxell et al. 1989). This problem has also been considered by 
Larrouturou (1991), who has proposed respective modifications for numerical 
schemes based on the multi-component approximate Riemann solvers of Roe 
(1981) and Osher (see Abgrall & Montagne 1989). 

To illustrate the problem, let us consider, e.g.,the hydrodynamics code 
PROMETHEUS, which is based on a direct Eulerian implementation of the 
piecewise parabolic method (PPM) of Colella & Woodward (1984) and which 
was developed by Fryxell et al (1989). In PROMETHEUS the values of the 
mass fraction of each species at the zone interfaces are obtained by interpo- 
lation including contact discontinuity detection. The parabolic distribution 
of each mass fraction is then constructed and monotonized. Since the mono- 
tonicity constraint in PPM is nonlinear, f.e.,the constraint operates on each 
abundance in a different way, the sum of the resulting interface values of the 
mass fractions will not, in general, be equal to unity. As a result, the sum 
of the zone averaged mass fractions at the new time will also not be unity. 
This local non-conservation error can have serious effects on the results of 
the calculation, at times even leading to unphysical features in the flow. 

One way to guarantee that the sum of the mass fractions remains unity 
is to omit one of the species equations. The value of the partial density 
for that species would then be determined by subtracting the sum of the 
partial densities of the other species from the total density. This has the 
undesirable consequence of concentrating the total error in one abundance. If 
the errors which occur become large (the error can frequently exceed 100%) 
the value of this last abundance will be meaningless. Fortunately, there is 
a better solution to this problem, as was shown by Fryxell et ai. (1989). 
When the error in the sum of the mass fractions at the zone interface after 
applying monotonicity is in error by more than some specified amount, the 
zone interface values are set equal to the zone average value. This reduces 
the species advection to first-order in that zone and causes more artificial 
mixing of species, but these errors are usually less severe than those obtained 
without the correction. In choosing the maximum amount of error allowed 
before flattening the zone structure, there is a tradeoff between the error in 
conservation and the amount of mixing. The maximum error allowed in the 
calculations of Fryxell et al. (1989) was 10“^. The time averaged value of the 
mass fraction at the zone interface is obtained by advecting the average value 
in the domain of dependence along the streamline. 
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4.4.4 Time Step Restrictions. Making an intelligent choice for the value 
of the time step 6t is one of the most critical aspects in efficiently solving 
problems which couple hydrodynamics and nuclear burning. Using too small 
a value will require the computation of an unnecessarily large number of time 
steps, whereas using too large a value will lead either to unacceptably large 
errors or, if an iterative method is used to solve the reaction network, an 
unconverged solution. 

When a reaction network is incorporated into an explicit hydro-code, 
several restrictions (besides the well-known CFL-condition; see Sect. 2,2.2) 
must be imposed on the time step. First, the temperature and the mass 
fraction of each nuclear species must not be allowed to change by too much 
during the time step. Since the non-iterative solution of the network requires 
linearization of the equations (see Sect. 4.3.2), if too large a change in any 
variable is permitted, the linear approximation will no longer be valid, and 
large errors will result. In addition, if an iterative procedure is used (see 
Sect. 4.3.3), the solution will not converge. These restrictions tstke the form 
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(4.50) 



(4.51) 



where the values of the dimensionless constants ar and ax depend on the 
type of network used for the calculation and how violent the burning is. Typ- 
ically, one allows changes in the temperature of 3% or less, while abundance 
changes of 5% to 10% can usually be tolerated. When an iterative method of 
solution is used for the network, another time step restriction is achieved by 
requiring that the solution converges in a small number of iterations. This 
restriction can be written as 



= St^ Min 




(4.52) 



where I is the number of iterations required for convergence on the previous 
solution of the network. This restriction has the effect of reducing St whenever 
the number of iterations required exceeds 3. 

The value of St is chosen to be the minimum obtained from each of the 
above equations in the most restrictive zone. There will be times when even 
this value of St will be too large for the network to converge. When this 
happens, there is an additional procedure which one can use to continue 
the calculation. The values of temperature, energy, and mass fraction are 
stored before the network is solved. If the network does not converge, the 
value of St can be reduced and the network recomputed starting from the 
original values. Since a smaller value of St was used for the network than 
for the hydrodynamics, the network must be solved several times until the 
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variables have been advanced to the correct time level. Equivalently, one could 
recalculate the entire hydrodynamics and burning step with the same value 
of St^ but this would require storing old values of all of the hydrodynamic 
variables. 

4.4.5 A Computational Aspect. As we have seen above solving a re- 
action network in an (ID) stellar evolution code requires the solution of a 
nonlinear system of equations independently for each grid point, if no mixing 
occurs. This parallelism can be utilized on vector and parallel processors. 
The reaction rates, the elements of the Jacobian, and the solution of the lin- 
ear systems arising during each iteration step should be calculated for all, or 
at least a large number, of grid points in parallel. 

Whereas the vectorization or parallelization of the linear systems poses 
a more challenging problem, the calculation of the reaction rates and of the 
elements of the Jacobian can be easily vectorized or parallelized by straight- 
forward programming techniques. It should be noted here that this task is 
extremely simplified if all reaction rates used in the network are given by a 
common fit-formula with rate-dependent coefficients. 

According to the idea of Gustavson et aJ. (1970), the sparse linear sys- 
tem arising within the Newton iteration of the network equations can be 
solved very efficiently. A computer program which uses symbolic processing 
generates another program which contains the FORTRAN statements for 
the optimal Gaussian elimination routine in the sense that it explicitly takes 
into account the sparsity pattern of the matrix by omitting all operations 
involving zeroes. It should be stressed here, however, that this approach as- 
sumes that no pivoting is required, which indeed is the case for the network 
equations. 

Besides eliminating unnecessary operations, this approach allows one to 
vectorize or parallelize the problem, because at all grid points the matrices 
have the same structure, differing only by the values of the elements. If the 
matrix elements for a whole set of grid points are stored in a three-dimensional 
array, where the last two indices give the row and column index of the element 
and the first index specifies the grid point, the sparse Gaussian elimination 
routine (and the program generator) can be designed in such a way that the 
innermost loop runs over the number of grid points, thereby fully exploiting 
the parallelism of the problem. Such a program generator, called GIFT, is 
available from the author upon request. 



4.5 Some Instructive Numerical Experiments 

Fryxell et al. (1989) have studied the coupled processes of hydrodynamics ajid 
nuclear burning in a general geometry. They critically examined some of the 
numerical tools available to the theorist performing an extensive set of test 
calculations of one-dimensional shock tubes with and without burning. In 
their study, Fryxell et ah (1989) considered several different Eulerian schemes 
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including a second-order flux-vector splitting scheme (SADIE; Arnold 1985) 
and the PROMETHEUS code based on the PPM scheme of Colella & Wood- 
ward (1985). They further tested a Lagrangian PPM scheme so that the 
effects of artificial mixing could Ccisily be seen. Although all tests problems 
considered by Fryxell et ai. (1989) were one-dimensional, they clearly showed 
the types of errors one will encounter in performing multi-dimensional calcu- 
lations. Some of their results will be discussed in this section (see also Muller 
1994). 

The equation of state used in the calculations of Fryxell et ai. (1989) was 
chosen to provide the correct qualitative behavior for a realistic astrophysical 
gas, but yet be simple enough to be used for easily reproducible test calcula- 
tions. Three contributions to the pressure are included - ideal gas, radiation, 
and a term which simulates a degenerate electron gas as a simple gamma-law. 
The effects of radiation pressure will strongly limit the temperature which 
the combustion products reach, and the effects of electron degeneracy can 
determine whether the fuel will burn explosively, producing a detonation or 
deflagration, or quietly, such as in the center of the Sun. 

All codes considered by Fryxell et aL (1989) were written in strict conser- 
vation form (see Sect. 3 of the lecture by LeVeque). In most circumstances, 
this is the best approach, since it guarantees that shock jumps will be cor- 
rect. However, using a conservative energy equation can lead to large errors 
if the kinetic energy is much larger than the thermal energy. In this case, the 
resulting pressure and temperature may be very inaccurate. When nuclear 
burning is included in the calculation, such large temperature errors could be 
disastrous. Using a nonconservative equation for only the internal energy is 
probably better under these circumstances. However, when such a situation 
does not occur, conservative methods should provide more accurate results. 

Two separate reaction networks were considered by Fryxell et aL (1989). 
The first was a very simple network consisting of two species, and ^^Ni, 
and a single exothermic reaction. In reality, carbon does not burn directly to 
nickel. However, the approximation was made for this network that after the 
12 c( 12 c^ a)^^Ne reaction is completed, the reaction products burn instantly 
to ^^Ni. Thus they used the rate for the initial reaction only but took the 
energy release obtained from burning the carbon completely to nickel. 

In some of their test problems they used a second more complex reaction 
network, which contained the 13 species of the complete a-chain up to ^®Ni. 
These nuclei (^He, ^^0, '^^Ne, ^^Mg, ^^Si, 225 ^ 36 40^^^ 44 ^ 1 ^ 48^^, 

^^Fe and ^^Ni) are linked by 27 reactions, which include the eleven (q, 7 ) reac- 
tions from ^^C(a, 7)^®0 to ^^Fe(a, 7 )^®Ni, the corresponding 11 endothermic 
photo-disintegration reactions, the three heavy-ion reactions ^^C(^^C,o:)^^Ne, 
i2c(i6o,o;)24Mgj and ^® 0 (^® 0 ,a)^^Si, and the triple-o reaction and its in- 
verse. 

4.5.1 Detonations Caused by Numerical Errors. In one set of test 
calculations Fryxell et aL (1989) showed that due to numerical errors artificial 
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detonations can be obtained. The initial conditions for this set of degenerate 
shock tube calculations with burning were given by 

ql = 2 X lO^gcm"^ Qr = 2x lO^gcm'^ 

ul =0 ur = 0 

Tt^SxlO^K Tr=^ 5 xlO^K 

The composition of the material was ^®Ni on the left side of the grid and 
on the right. The simple two species nuclear reaction network for burning 
carbon to nickel was included, and the radiation pressure term was discarded 
in the equation of state (see Sect. 4.5.1). The calculations were performed on 
a grid of length 0.1 cm with the initial jump located at 0.05 cm. 

The to the right of the contact discontinuity is initially at too low a 
temperature to burn, and the shock is far too weak to raise the temperature 
to the ignition point. There is no fuel to the left of the contact discontinuity 
where the temperature is high, so there is no burning in that region either. In 
fact, if the contact discontinuity remained perfectly sharp, there would be no 
energy generation from burning during the entire calculation. However, since 
the discontinuity is spread slightly by Eulerian difference schemes, a small 
amount of the fuel is diffused into the region where the temperature is high, 
and some of the heat is conducted from the hot region into the fuel. Thus 
there is a narrow region where nuclear burning occurs. The amount of diffu- 
sion is, however, small enough that the energy generation is not dynamically 
important, and the results are qualitatively correct. The flow never reaches a 
configuration where the temperature and the abundance of are large in 
the same zone. In fact, at the end of the calculation, not even one zone has 
burned completely. 

When using a flux-vector diflFerencing scheme (i.e., SADIE) instead of 
PPM (i.e., PROMETHEUS) a qualitatively different result was obtained. 
Unlike the other schemes in the comparison, SADIE has the property of 
rapidly spreading slowly moving (and even stationary) contact discontinu- 
ities. Thus the amount of mixing between the cold fuel and the hot ashes 
across the contact is much larger than for the other codes. Under these con- 
ditions, the amount of mixing was sufficient to cause the fuel to detonate. The 
temperature of the fuel is raised to more than 4 x 10^^ K producing an en- 
ergy generation rate of 10^^ergg“^ s“U The resulting detonation appears 
physically correct, traveling at the correct speed and producing the correct 
post-detonation conditions. However, the origin of the detonation was due 
completely to a numerical error. 

To complicate matters further Pryxell et al. (1989) performed a calculation 
using Eulerian PPM with slightly modified initial conditions. The fluid in the 
entire grid was given an initial velocity ul — ur ~ 10^ cms~^. The results 
should be exactly the same as in the previous experiment except shifted to the 
right by a distance uit. However, the results looked almost identical to those 
obtained with SADIE. The only reason SADIE gives different results from the 
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other codes was that the contact discontinuity was nearly stationary. When 
the discontinuity is moved across the grid, every Euler ian code will spread 
it to some extent. Even with PPM, the code which spreads discontinuities 
the least of those tested, the spreading was sufficient to cause the to 
detonate, producing a qualitatively incorrect solution. This shows the amount 
of caution which is needed in calculating reactive flows with an Eulerian 
hydrodynamic code. 

4.5.2 Detonations with Single Exothermic Reaction. Another set of 
test problems Fryxell et ai. (1989) considered was one containing a detonation, 
which, unlike the previous example, is not of numerical origin. The fluid to the 
left of the interface was given a finite velocity which produces a sufficiently 
strong shock to ignite the fuel to the right of the interface. The equation 
of state for this example included all three terms (ideal gas, radiation, and 
simplified electron degeneracy). Nuclear energy generation was provided by 
the simple reaction network containing only and ^®Ni. This is the simplest 
possible situation to analyze since there is only a single exothermic reaction 
to deal with. The initial conditions used for this set of calculations were 
ql = 1-25 X lO^gcrn"^ — 1 x lO^gcm”^ 

ul — 5 X 10® cm s"^ Ur = 0 

Tl = 2x 10^^ K Th = 8 X 10*^ K 

The initial composition of the material to the left of the jump was pure ®®Ni 
with pure to the right. The calculations were performed on grids of length 
2, 4, 8 and 16 x 10”^ cm, with the initial jump always being in the middle of 
the grid. 

In this set of calculations, the number of zones in the grid was fixed at 200, 
while the length of the grid was increased. This provides an indication of how 
coarse a grid can be used while still obtaining acceptable results. In doubling 
the grid length from 2 x 10~^ cm to 4 x 10“^ cm, the only noticeable difference 
in the results is in the resolution of the peak behind the shock. The speed 
of the front and post-detonation state remain unchanged. However, when 
the grid length is doubled again, the results change qualitatively. The shock 
now propagates to the right with a velocity considerably higher than the 
Chapman-Jouguet velocity, and an additional plateau appears in the profile. 
The values at the second plateau are still the same as the post-detonation 
values obtained in the more finely zoned calculations, as they must be in 
a conservative scheme. However, because of the additional unphysical wave 
which forms, the results are totally unacceptable. If the grid size is doubled 
again, the situation becomes worse. The speed of the shock front increases 
further and the discrepancy between the post-shock values of the variables 
and the correct post-detonation values is even larger. Similar behavior was 
noticed by Colella et ai. (1986). 

This example clearly shows the fallacy of the argument that, since the 
jump conditions in a detonation, as in a shock, are based only on conser- 
vation laws, detonations can be calculated on any mesh with a conservative 
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difference scheme. In ideal hydrodynamics, the shock front is infinitely thin 
(see Sect. 4.2.2; Fig. 4.1). Thus, the fuel spends no time within the shock and 
does not start to burn until it reaches the post-shock state. The problem 
arises when the time which the fuel spends within the shock front, which is 
given by the width of the shock produced by the difference scheme divided by 
the velocity of the shock, becomes comparable to or longer than the burning 
time of the fuel. When this happens, nearly all of the fuel is burned within 
the shock front. Thus, when the numerical errors become large enough that 
the physics of the problem is violated, it is not surprising that an unphysical 
solution is obtained. If the energy production within the lower portion of the 
shock front is large enough, the pressure in these zones will be raised suffi- 
ciently to create a bulge in the shock profile. This bulge then propagates to 
the right ahead of the real front, producing the extra wave. 

The grid size at which each code in the test made the transition to the 
unphysical solution was different and depended on the shock width and 
amount of mixing of the fuel and ashes. 

SADIE and Eulerian PPM worked properly until the grid length reached 
8 X cm. The Lagrangian PPM method produced unacceptable results 
only when a grid size of 16 x 10”^ cm or larger was used. 

It was possible, however, to obtain acceptable results on large grids by 
making a small modification to the codes. The incorrect solution was ob- 
tained because the burning was occurring within the shock instead of behind 
it (see Sect. 4.2.2; Fig. 4.2). This error can be avoided by not allowing the fuel 
to burn in any zone which contains a shock. Such zones are easily detected 
in a hydrodynamic code. Any zone which contains a sufficiently large pres- 
sure gradient and a negative velocity divergence (to ensure that the zone is 
compressing) can be considered to be within a shock. The detonation wave 
obtained using this algorithm with the Eulerian PPM code moved at the 
correct velocity and had the proper post-detonation state for grid lengths, at 
which the unmodified algorithm failed before. 

4.5.3 Detonation with a-Network. Fryxell et al. (1989) simulated an- 
other shock tube containing a detonation. However, the energy generation was 
now produced by the 13 species, 27 reactions a-network described above. This 
calculation provides an example of the type of results which can be obtained 
when using a moderately complex network with an Eulerian hydrodynamic 
code. 

For a detonation with an a-network it is impossible to obtain a solution 
which is even close to being converged using the techniques examined in 
the survey of Fryxell et ai. (1989). Since the burning times for the various 
reactions differ by several orders of magnitude, there is no way to resolve the 
time and length scales for each of the burning processes. In this sense, the 
detonation wave is no longer thin. If one chooses to resolve the structure of 
the front, as in the previous example, then the calculation cannot proceed 
long enough to follow the completion of the burning. If, on the other hand. 
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the calculation is performed to see what the final products of the burning will 
be, the structure of the front cannot be resolved. This also makes calculating 
the correct qualitative behavior easier than with the simple network. Since 
the front is wider for the a-network, qualitatively correct detonations can be 
computed on grids large enough that meaningful calculations of stellar size 
problems can be performed. 

The initial conditions for the detonation with an a-network computed by 
Fryxell et al. (1989) were 

ql = 2.5 X 10® g cm“^ qr - 1 x 10^ g cm“^ 

— 5 X 10® cms“^ Ur — 0 

Tl = 8 X 10® K Tr = SxlO'^K 

The initial composition of the material to the left of the jump was pure ^®Ni 

with pure to the right. All other mass fractions were set to an initial 
value of 10“^^ in each zone, and all three terms in the equation of state were 
included. Results were obtained on grids of two different lengths, 0.01cm 
(jump at a: = 0.005 cm) and 1 cm (jump at a: — 0.5 cm), so that the structure 
of the front could be seen on two different time scales (^ = 6 x 10~^^s and 
i = 6 X 10“ respectively). 

The structure of the detonation obtained using Eulerian PPM on a 1 cm 
grid of 800 zones at i = 6 x 10“^® s is characterized by a small poorly resolved 
peak immediately behind the shock front. Behind the peak is a plateau which 
is not quite flat, since there is still nuclear burning occurring in this region. At 
the left of the grid is a second plateau. The density and pressure in this region 
have been reduced slightly by a rarefaction wave which has propagated to the 
left from the initial discontinuity. The two plateaus are joined by a contact 
discontinuity. This discontinuity is uncharacteristically wide for PPM because 
of the nonlinear interaction of the diffusion of species across the discontinuity 
with the burning processes. The enhanced rate of species diffusion is a result 
of the flattening of the zone structure required to maintain local conservation 
of the abundances. The structure of the detonation obtained on the 10“^ cm 
grid is qualitatively similar. 

The time evolution of the abundances shows three distinct regions in the 
post-detonation flow. The region between the shock and the contact discon- 
tinuity contains the early time products of carbon burning. To the left of the 
contact discontinuity is a region in which the initial composition of almost 
pure ^^Ni is beginning to photo-disintegrate to form an equilibrium distri- 
bution of abundances. At the contact discontinuity, there is a third narrow 
region which contains a mixture of species from the other two regions. This 
mixing is unphysically large and results completely from the diffusion present 
in the Eulerian hydrodynamic method. 

The evolution of the abundances proceeds as follows. Since the mate- 
rial ahead of the shock is pure the only reaction in the network which 
can operate is ^^C(^^C,'^He)^®Ne. Thus the dominant abundances immedi- 
ately behind the shock should be '^He and ^®Ne. This composition layer is 
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just barely detectable on the smaller grid. Further behind the shock, ^^Ne 
photo-disintegrates into ^^0 and ^He. With the formation of a significant 
abundance of the three heavy-ion reactions become dominant, produc- 
ing ^^Ne, ^^Mg, and ^^Si. The abundance of which is produced by the 
^®Si(a,7)^^S reaction, is also becoming significant near the contact disconti- 
nuity on the smaller grid. The remaining (a, 7) reactions are just beginning at 
this time {t « 10“^^ s), producing trace abundances of Ar, ^®Ca, ^^Ti, and 
'^^Cr. At a somewhat later time (t = 6 x 10“^^ s) and are depleted 
and the products of the (0,7) reactions are increased. The mass fraction of 
the heaviest element in the network, ^^Ni, has increased to about lO""* near 
the contact discontinuity at this time. 

The abundances obtained on a coarser grid of 200 zones are qualitatively 
similar to those discussed above. The primary difference is that the region 
where mixing has taken place is much wider. Fortunately, the mixing in this 
case does not seem to have a significant effect on the qualitative results of 
the calculation. Although the burning was far from being complete at the 
end of the test calculation of Fryxell et ai. (1989), the propagation velocity of 
the detonation front was approximately correct, and the variables behind the 
detonation appeared to be approaching the correct values. However, Fryxell 
et ai. (1989) pointed out that in similar calculations performed with more 
diffusive schemes the mixing of species will dominate the abundance profiles 
over more than half the region between the shock and contact discontinuity, 
making it difficult, if not impossible, to follow the abundance changes caused 
by the various burning processes in this part of the flow. 

The result obtained by Fryxell et al. (1989) using the Lagrangian PPM 
code were much cleaner, since there was no mixing region in the center to 
corrupt the abundance profiles. The boundary between the two burning re- 
gions was very sharply defined. Even so, except for this central region, the 
results obtained by the Eulerian code agreed very well with the Lagrangian 
results. Thus, it appears that even with a network containing many species, 
a good Eulerian code can produce reasonable results for many problems. For 
more diffusive Eulerian methods the mixing in the central region could be 
so severe that it would corrupt the abundance profiles over the entire grid, 
making it virtually impossible to observe the abundance changes due to the 
various burning processes. 



5. Simulation of Astrophysical Jets 

Jets, i.e., highly collimated supersonic outflows, are a common astrophysical 
phenomenon. They are found associated with young stellar objects (YSOs; 
Mundt 1988; Eisloffel 1996), planetary nebulae (Mellema 1996), short-period 
cataclysmic variables (Shabaz et aJ. 1997), compact stars (Mirabel & Ro- 
driguez 1994; Vermeulen 1996; Bell 1997) and active galactic nuclei (AGNs; 
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e.g., Bridle & Perley 1984; Muxlow & Carrington 1991). Despite their ubiq- 
uitity and decades of intense observational and theoretical efforts, the for- 
mation, collimation and acceleration of jets is still not understood (see the 
articles in the books edited by Hughes (1991), Bugarella et ai. (1993), Hardee 
et ai. (1996) and Kundt (1996). 

Many of the ideas and models, which have been proposed to explain the 
formation of astrophysical jets, involve accretion onto a central object, such as 
a young star surrounded by the remainder of the molecular cloud in which it 
formed, or a rotating super massive black hole in an active galactic nucleus be- 
ing fed by interstellar gas and gas from tidally disrupted stars (e.g., Begelman 
et ai. 1984). However, accretion seems to be no necessary prerequisite for the 
formation of jets, because there exists observational evidence that isolated 
pulsars and neutron stars can produce jets (Bell 1997). Concerning the (ini- 
tial) collimation and acceleration of astrophysical jets most models involve 
MHD processes at or near the inner edge of a rotating magnetized accretion 
disk with corona. The combined action of thermal and magnetic pressure 
gradients, of centrifugal force and of gravity drives an outflow, which does 
collimate itself (Blandford & Payne 1982). General relativistic effects seem to 
be crucial for a successful launch of the jet (for a review see, e.g., Camenzind 
1997). 

Numerical studies of extragalactic jets can be divided into several dis- 
tinct categories (e.g., Lovelace & Romanova 1996). Ordered with increasing 
distance to the source these categories are the formation and collimation of 
jets at sub- parsec-scales, and the propagation of jets at parsec-scales and 
at kiloparsec-scales, respectively. Up to now most simulations have dealt 
with the propagation of jets in the framework of Newtonian hydrodynamics. 
A Newtonian treatment is justified, however, only at distances larger than 
several kiloparsec. At smaller scales, and in particular at parsec-scales, jets 
propagate at relativistic speeds (see Sect. 5.1). Hence, special relativistic hy- 
drodynamic simulations must be used in order to describe their propagation 
correctly (see Sect. 5.4). Simulating the formation and collimation of jets re- 
quires both more complicated physics and numerics, because hydromagnet ic 
and general relativistic effects must not be neglected. Up to now, only a 
few such simulations have been performed (e.g., Shibata & Uchida 1986; Bell 
1996; Stone & Norman 1994; Koldoba et ai. 1995; Meier et ai. 1996; Ouyed et 
ai. 1997; Romanova et ai. 1997). 

As a general discussion of astrophysical jets is far beyond the scope of 
my lecture, I will focus on hydrodynamic simulations of propagating (non- 
magnetized) extragalactic jets, i.e., of jets entering the computational domain 
with supersonic speed. Firstly, however, I will briefly review some observa- 
tional findings concerning extragalactic jets. 
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5.1 Observations of Extragedactic Jets 

In 1918 H.D. Curtis photographed the giant elliptical galaxy M87 and no- 
ticed a “curious straight ray”, which was “apparently connected with the 
nucleus of the galaxy by a thin line of matter” , This was the first discovery 
of a so-called extragalactic jet. Nowadays several hundreds of these extra- 
galactic jets are known, which are detectable by the synchrotron and inverse 
Compton radiation they emit at radio frequencies. Very high resolution radio 
observations (VLB I imaging) provide direct evidence that the jets transport 
enormous amounts of energy from the nuclei of radio galaxies and quasars 
to intergalactic space thereby powering the observed extended regions of ra- 
dio emission (called radio lobes; Fig. 5,1) of these sources (Blandford & Rees 
1974; Scheuer 1974; Bridle & Perley 1984). 




Fig. 5.1. AGN scenario: In the active galactic nucleus (AGN) of a galaxy energy 
is produced by accretion of interstellar gas tind of gas from tidally disrupted stars 
onto a rotating super massive (^ lO^M©) central black hole. The liberated grav- 
itational binding energy is converted by some mechcinism, probably involving an 
accretion disk and magnetic fields, into a highly collimated bipolar outflow (twin 
jets), which transports the energy up to several 100 kpc out of the AGN, thereby 
feeding extended regions (lobes) of radio emission 



High resolution radio interferometry has shown that the jets are extremely 
well collimated with opening angles of a few degrees only, extending from the 
very center of the galaxy (i.e., distances of less than a parsec) all the way out 
to distances of up to several hundreds of kiloparsecs (Bridle & Perley 1984; 
Muxlow & Carrington 1991). Whereas motion is observed on parsec-scales, 
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except for M87 (e.g., Biretta 1996), none of the sources in which jets have 
been detected shows any direct evidence for motion on kiloparsec-scales. Nev- 
ertheless, the observed continuity of the jets from parsec to kiloparsec-scales, 
supports the commonly accepted view that jets are collimated, continuous 
outflows. Whether the jet “fluid” consists of a baryonic plasma (i.e., protons 
and electrons), an electron-positron pair plasma, a mixture of both or pre- 
dominantly of electromagnetic Poynting flux is still debated controversially 
(e.g., the books edited by Harry et a!. 1996 and Kundt 1996). 

At kpc scales, the morphology of the observed jets varies from weak 
sources (Fanaroff-Riley Class I sources; Fanaroff & Riley 1974) to power- 
ful sources (Fanaroff-Riley Class II radio galaxies and lobe-dominated radio- 
loud quasars). The most remarkable difference is that FRI sources tend to 
have (at kpc scales) prominent smooth continuous two-sided jets running into 
the lobe structures, whereas powerful sources have usually one-sided (with 
a jet/counterjet flux density ratio > 4:1), knotty jets with bright outer hot 
spots (e.g.jMuxlow & Carrington 1991). In core dominated sources, where 
the luminous core is usually associated with a bright one-sided radio jet, in- 
terferometric imaging of very long baseline (VLBI) has revealed a continuity 
between small and large-scale structures with inner pc scale one-sided jets 
extending to outer kpc scale jets. 

Features in the bright one-sided jets of FRII sources are observed to 
be (apparently) moving away from the core at superluminal speeds in more 
than 40 sources (Ghisellini et aL 1993) to distances beyond 100 pc (as, e.g.,in 
the quasar 3C273; Davis et al. 1991) indicating relativistic flow. Additional, 
independent observational support of highly relativistic speeds can be derived 
from the intra-day radio variability occurring in more than a quarter of all 
compact extragalactic radio sources (Krichbaum et ai. 1992 ). According to 
Begelman et ai. 1994, if the observed intra-day radio variability is intrinsic 
and results from incoherent synchrotron radiation, the associated jets must 
have bulk Lorentz factors in the range ~ 30 - 100. Relativistic outflow with 
apparent superluminal motion has also been observed in the Galaxy, namely 
in the two galactic “micro-quasars” GRS 1915+105 (Mirabel & Rodriguez 
1994) and GRO J1655-40 (Tingay et ai. 1995). These galactic X-ray sources 
consist of a binary, where one member of the binary is most likely a black 
hole. 

Within the nowadays accepted model of compact radio sources (e.g., Bla- 
ndford et ai. 1977) the apparent superluminal motion observed in these 
sources, as well as their one-sidedness, are explained by assuming that one of 
the twin jets propagates with relativistic speed at a small angle to the line of 
sight towards the observer (Fig. 5.2). In that case the apparent time duration 
(between two events) seen by a distant observer is compressed, because the 
source “chases behind” its own radiation. Hence, the apparent velocity of 
any transverse motion can appear to exceed the speed of light. In particu- 
lar, this holds for the radio condensations (“radio blobs”) ejected from the 
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Fig. 5.2. When a source of radiation propagates 
with relativistic speed v = /3 c at a small angle 
O to the line of sight towards the observer, its 
transverse velocity vt can apparently exceed the 
speed of light c when measured by the observer. 
This purely kinematic effect is a consequence of 
the relativistic transverse Doppler effect 



source with apparent superluminal speeds. This purely kinematic effect is a 
consequence of the relativistic transverse Doppler effect. A distant observer 
measures an apparent transverse velocity Vohs = Pobs c for any feature mov- 
ing with a space velocity i; = ^ c at an angle 0 to the line of sight towards 
the observer, which is given by (Blandford & Konigl 1979) 



Pobs — 



PsinO 
I — P cos 0 



(5.1) 



This relation implies that the apparent transverse velocity has a maximum 
for cos 6> = /3 with 



/JoTs* = WP , 



where 



W = 



1 



(5.2) 

(5.3) 



is the Lorentz factor of the feature. According to Eq. (5.2) P^^ > 1 for 
P > l/\/5. Hence, the apparent transverse velocity can become superluminal 
for large jet velocities (i.e.,/3 close to 1) and sufficiently small angles 0. 

The one-sidedness of jets observed in FRII sources can be explained by 
relativistic motion at a small angle to the line of sight, too. Due to relativistic 
aberration radiation emitted at some angle to the flow direction in the source 
frame, is seen at a smaller angle in the observer’s frame. In addition, the 
emission from the jet propagating towards the observer is blue shifted while 
that of the counterjet is red shifted. Both effects give rise to the so-called 
Doppler beaming of the emission in the direction of motion. The resulting 
asymmetry in the luminosity of the twin jets can be quantified through a 
Doppler factor given by (Blandford & Konigl 1979) 
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V = W~^ (1 - /3cos0)”^ . (5,4) 

For an optically thin source, the observed surface-brightness Hobs is related 
to the surface-brightness S that would be measured by a comoving observer 
by 

TobsH ocr(j/)p2+« , (5.5) 



where a is the spectral index of the emitted radiation. Thus, the relative 
Doppler boosting of jet and counter-jet is proportional to 



1 + /3 cos 0 



l2+a 



1 - /?COS0J 



(5.6) 



For /3 — 0.99 (f.e., a Lorentz factor W ^1) and 0 < 5 degrees the fraction in 
Eq. (5.6) is larger than 144 (and less than 199). The resulting jet-counter-jet 
surface-brightness ratio is larger than 2 x 10^, since 0.5 ^ a ^ 1. 

Doppler beaming also provides an explanation for the Laing-Garrington 
correlation. According to Laing, Carrington and colleagues (Carrington et 
aJ. 1988; Laing 1988) the near side lobe of a powerful radio source (i.e., FRII) 
containing a jet is almost always less depolarized than its counterpart on 
the far side of the nucleus. This asymmetry can be explained by assuming 
that the jets are intrinsically two-sided and relativistic. Hence, the nearer 
jet appears brighter and radiation from its lobe passes through less of the 
depolarizing medium around the source. 

Whether the relativistic flows inferred from radio jets at sub-parsec and 
parsec-scales extend to kiloparsec distances is still an unsolved issue. The 
common belief is, however, that jets in FRII radio galaxies and quasars 
which show significant flux eisymmetries between jet and counterjet are at 
least mildly relativistic on large scales. In fact, bulk flow velocities of the 
order of 0.7 c have been found for several sources (1928+738: Hummel et 
aJ. 1992; 3C179: Akujor 1992; 1055+201, 1830+285, 2209+080: Hooimeyer et 
ai. 1992). The observed asymmetries indicate that relativistic motion extends 
out to kpc scales, although with smaller values of the overall bulk speeds 
(Bridle et ai. 1994). This slow-down of the jets makes it necessary to look for 
adequate models of flow deceleration between parsec and kiloparsec-scales. 

In the case of FRI sources, direct measurements of the proper motion of 
knots in the M87 jet performed with the VLA (Biretta & Owen 1990; Biretta 
et ai. 1995) have proven the existence of apparent superluminal motions out to 
a distance of about one kiloparsec. According to these authors, the measure- 
ments are compatible with a kinematic model for the jet in M87 consisting of 
an inner jet (from the nucleus to knot A located 3it ^ 1 kpc) containing a high 
Mach number flow with a Lorentz factor ^ 3 and a slower outer jet (Lorentz 
factor ^ 2) beyond knot A. Such a relativistic inner jet could be responsible 
for the one-sidedness of the jet base seen in many FRI sources (Muxlow & 
Carrington 1991). Recent observations support the idea that the jets of all 
FR I radio sources are initially relativistic (on pc-scales) and are slowed down 
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to sub-relativistic speeds at distances of ~ 100 pc (e.g., Komissarov 1990a, b; 
Bicknell 1994, 1995; Urry & Padovani 1995; Laing 1996). Applying such a 
decelerating relativistic jet model to the archetypical FRI source 3C31 leads 
to an almost constant beam deceleration from 0.8 c at distances less than two 
kpc from the nucleus, to 0.1c at distances 7kpc (Laing 1996). Further- 
more, data from HST optical and MERLIN radio observations of the quasar 
3C264 are consistent with a model where an initially highly relativistic jet 
{v ~ 0.98 c) decelerates dramatically to a velocity 0.4 c at a projected dis- 
tance of ~ 300 pc (Baum et ai. 1997), which coincides with the outer boundary 
of the galactic disk. 

5.2 Newtonian Hydrodynamic Simulations of Extragalactic Jets 

Although the collisional mean free path is very large in the jet plasma, a 
hydrodynamic description of the jet flow is justified, because of the presence 
of micro-Gauss magnetic fields which provide the collisional coupling of the 
plasma. In extragalactic jets the Larmor radii and Debye lengths of positrons 
(or protons) and electrons are several orders of magnitude smaller than the 
jet widths (Begelman et aJ. 1984). For this reason, hydrodynamic simulations 
have become an important tool in our understanding of the morphology and 
dynamics of extragalactic jets ever since Norman et aJ. (1982) verified the 
jet model of Blandford & Rees (1974) in their pioneering investigation (for a 
review see, e.g., Norman 1993, 1996a). 

5.2.1 Numerical Set-up and Parameter Space. Simulating the propa- 
gation of non-relativistic or Newtonian jets consists of solving the following 
initial-boundary value problem: A continuous collimated beam of gas density 
velocity Ub and pressure Ph is injected through an aperture of radius i?b 
into a computational domain filled with an ambient medium gas of density 
Pm and pressure Pm moving with a velocity Um- In the simplest numeri- 
cal setup one assumes that the beam is introduced with zero opening angle 
(i.e., perfect collimation), the ambient medium is homogeneous, and that both 
the jet material and the ambient medium are an ideal gas with a specific heat 
ratio 7 = const. Accordingly, the initial-boundary value problem, and hence 
the flow, is specified by six quantities Qh^Vb^Pbi Qm^Vm^Pm and the choice 
of a length or time scale, or a reference frame. 

In astrophysical jet simulations, the hydrodynamic equations are com- 
monly solved in normalized form, i.e., the three conservation equations for 
mass, momentum and energy are scaled to make them dimensionless (e.g., Nor- 
man et ai. 1982). The units of length, velocity and density are chosen to be 
the jet (beam) radius Rb, the sound velocity Cm and the density Pm of the 
ambient medium. Then, for example, the unit of time is to ~ Pb/cm« Fix- 
ing the frame of reference by assuming the ambient medium to be at rest 
(i.e.jUm = 0) the flow is completely characterized by three dimensionless 
parameters: the density ratio 
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'n 



= ^ 






(5.7) 



and the pressure ratio 

K = ^ (5.8) 

of the incoming beam gas to the ambient medium, and the Mach number of 
the jet 



Mab = — 
Cb 




(5.9) 



where Cb is the sound velocity in the beam gas. Note that the Mach number 
is chosen as the third parameter, because Um = 0. In most simulations of 
astrophysical jets the parameter K is assumed to be unity, i.e., only pressure- 
matched jets are considered. 



5.2.2 Simulations. The majority of jet simulations has been (and is still 
being) performed in two spatial dimensions. The computational domain in 
these simulations is either of rectangular (Cartesian geometry) or cylindrical 
shape (axial symmetry). The latter geometry is a natural one for study- 
ing the propagation and morphology of axially symmetric jets. When 3D 
simulations cannot be afforded to investigate the behaviour of jets against 
non-axisymmetric instabilities, 2D simulations in Cartesian geometry are an 
alternative using planar or slab jets. The size of the computational domain 
in jet simulations typically encompasses ~ lOO/^b in the direction of the jet 
propagation and ~ 10 — 20i?b in the perpendicular direction. The spatial 
resolution varies from 5 — 10 zones per beam radius in simulations with poor 
resolution to ~ 100 zones per beam radius in the largest 2D simulations. The 
first three-dimensional simulations (in Cartesian coordinates) used relatively 
coarse computational grids of ~ 10^ zones (Williams & Gull 1984) and 10® 
zones (Arnold 1985; Arnold & Arnett 1986), respectively. Meanwhile 3D sim- 
ulations have been performed with ~ 10^ zones (Norman 1996b; Muller & 
Brinkmann 1997). 

It has been found that 10 zones per beam radius are adequate to capture 
the essentials of the jet flow with a modern higher order-accurate shock- 
capturing finite volume scheme such as MUSCL, PPM or TVD (e.g-., Norman 
1996a). Twenty zones are required to resolve additional internal structure in 
the jet and to correctly describe the dominant modes of the Kelvin-Helmholtz 
instability responsible for bending and filamenting of the jet (Hardee & Clarke 
1992). In order to begin to get an accurate estimate of the mass entrainment 
caused by the onset of turbulence in an extragalactic jet a resolution of at 
least 40 zones per beam radius is necessary (Boris et al. 1992; see also Token 
et aJ. 1996). 

Since the pioneering work of Norman et al, (1982) hydrodynamic simula- 
tions have been used successfully to investigate the morphology and dynamics 
of propagating Newtonian jets (see next subsection). Magnetohydrodynamic 
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simulations have shown the importance of toroidal magnetic fields for the 
confinement of jets (e.g., Clarke et aJ. 1986, 1989; Lind et aJ. 1989; Kossl et 
aJ. 1990a, b), and for the appearance of genuine new MHD features, like 
e.g., the formation of a “nose cone” (Clarke et aJ. 1989). K magnetic fields are 
dynamically important, they inhibit the formation of large cocoons, which 
suggests that magnetic fields in the lobes of classical double radio sources are 
probably dynamically unimportant (Clarke et al. 1989). 

On kiloparsec-scales, the assumptions of axisymmetry and of a homoge- 
neous environment become doubtful. Consequently, 2D (planar) and 3D hy- 
drodynamic as well as MHD jet simulations have been performed to study 
the stability of jets (e.g., Hardee & Clarke 1992; Bodo et al. 1994; Hardee & 
Clarke 1995; Hardee et ai. 1995; Hardee 1996). Recently these simulations 
have been extended to consider the growth of the Kelvin-Helmholtz insta- 
bility in (optically thin) radiatively cooling jets (Stone et aJ. 1997). Models 
for narrow-angle-tail sources whose complex morphologies are the result of 
the interaction of a supersonic jet with a cross wind (Begelman et al, 1979) 
have been tested by means of 3D simulations (e.g., Williams & Gull 1984; 
Balsara & Norman 1992). The propagation of a jet into an oblique magnetic 
field (Koide et aJ. 1996), across an oblique interstellar medium/intra-cluster 
medium interface (e.g.,Hooda & Wiita 1996), and the deflection of a jet by 
an off-center collision with dense clouds of gas (de Young 1991) are a few 
other “environmental” issues investigated by hydrodynamic simulations. 

Jet simulations have to be continued long enough in time to account 
properly both for the long-term evolution of powerful extragalactic sources 
with typical life-times ~ 10® yr (corresponding to a distance of a few 10^ jet 
radii) and the observed deceleration of (initially relativistic) jets at distances 
beyond several 10^ pc (see Sect. 5.1). This requirement poses a severe numer- 
ical problem, because a very large computational domain and hence a large 
number of grid zones are necessary for such studies. Long-term 2D Newto- 
nian hydrodynamic simulations have been performed by Hooda et al. (1994) 
which cover a period corresponding to ~ 10® yr. They considered conical low- 
density jets propagating several hundred times their original radii with jet 
parameters believed to be suitable for powerful radio galaxies. However, be- 
sides being Newtonian, these simulations were concerned with what happens 
to jets when they cross pressure-matched interfaces between the interstellar 
and the intra-cluster medium, and not with the deceleration of the jets (see 
also Sect. 5.6). 

5.3 Morphology and Dynamics 

In a series of papers Norman and collaborators (Norman et al. 1982, 1983, 
1984, 1985) presented a comprehensive numerical study of pressure-matched 
{K = 1; Eq. (5.8)), axisymmetric, supersonic (Mb > 1) classical jets propa- 
gating into a homogeneous medium. Initial jet models were constructed by 
varying the beam Mach number, Mb, and the density ratio 7} (Eq. 5.7). The 
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study covered one decade in Mb {from 1.5 to 12) and three decades in 7 ] (from 
0.01 to 10; Fig. 1 of Norman et ai. 1983) including light or diffuse {t} < 1) and 
heavy or dense (r? > 1) jets. A detailed understanding of the morphology and 
dynamics of supersonic axisymmetric jets was derived from the study, which 
is summarized in the following. 

5.3.1 Overall Morphology, Supersonic pressure-matched light {r] < 1) 
jets consist of a supersonic beam of nearly constant diameter, which stays well 
collimated due to the presence of internal oblique shock waves and of plane 
and centered rarefaction waves (Figs. 5.3 and 5.4; see also Sect. 5.3.5). The 
beam ends in a contact discontinuity called the working surface (Blandford 
& Rees 1974), which separates ambient gas shocked by a leading bow shock 
from beam gas decelerated to the speed of the contact discontinuity by a 
trailing terminal shock configuration called a Mach disk. The Mach disk, the 
contact discontinuity and the intermediate beam cap gas form the head of 
the jet. 




Fig. 5.3. Morphology of an axisymmetric supersonic light jet propagating into a 
homogeneous ambient medium (see text for details) 



The propagation speed of the working surface determines the velocity of 
the jet, i.e., its advance through the ambient medium. The jet velocity should 
not be confused with the actual velocity of the material in the beam which 
is referred to as the beam flow velocity. 

Part of the shocked, high pressure beam cap gas is deflected sideways and 
flows backward along the beam, feeding the cocoon. The deflection process 
involves the repeated formation of a strong vortex at the jet head, which is 
then shed of the beam cap and becomes part of the backflow in the cocoon. 
Simultaneously, a new vortex is generated. The backflow in the cocoon is 
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Z-AXIS 



Fig. 5.4. Snapshot of the density distribution (logarithmic scale) of an axisym- 
metric, supersonic (Mab = 6), light (?/ = 0.1), pressure-matched jet propagating 
into a homogeneous ambient medium (from Kossl & Muller, 1988). The simulation 
was performed with a very high grid resolution of 100 zones per becim radius. The 
z-SLxis corresponds to the symmetry axis of the jet 



supersonic except for the regions between the shedded vortices. At the cocoon 
boundary the subsonic shear flow gives rise to Kelvin-Helmholtz instabilities, 
which eventually lead to a turbulent cocoon (see below). 

5.3.2 One Dimensional “Jets”. Except for the cocoon and the oblique 
shocks and rarefactions in the beam, the main structural features of a su- 
personic jet (bow shock, working surface and Mach disk) can be understood 
from a simple one-dimensional jet analogue. Consider the situation defined in 
the upper part of Fig. 5.5, where two pressure-matched states are separated 
by a diaphragm. The gas on the left side of the diaphragm has a supersonic 
velocity towards the right (i,e.,in positive x-direction), while the gas on the 
opposite side is at rest. The moving gas has a lower density than the gas at 
rest. This collision of a supersonic flow with a gas at rest exactly resembles the 
conditions on the symmetry axis of a light axisymmetric pressure-matched 
jet propagating into a quiet ambient medium. 

When the diaphragm is suddenly removed three waves are produced (see 
lower part of Fig. 5.5): a shock and contact discontinuity propagating in pos- 
itive x-direction, and a reflected shock propagating towards the left with 
respect to the contact discontinuity (but still in positive ^-direction!). The 
ambient gas passing through the shock is compressed, heated and acceler- 
ated to a finite velocity. The beam gas passing through the reflected shock is 
decelerated to the post-shock velocity of the ambient gas and compressed to 
the corresponding post-shock pressure. The contact discontinuity separates 
the very dense shocked ambient gas from the decelerated, less dense com- 
pressed beam gas. Obviously, the shock of the ‘TD jet” corresponds to the 
bow shock in an axisymmetric jet, while the contact discontinuity and the 
reflected shock resemble the working surface and the Mach disk, respectively. 
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reflected shock contact shock (bow shock) 

discontinuity 



Fig. 5.5. One-dimensional pressure-matched light “jet”: A supersonic flow collides 
with a homogeneous gas at rest. The gas in the flow is initially in pressure equi- 
librium with the gas at rest and its density is lower than that of the ambient gas. 
The panel at the top shows the initial setup, while the lower panel illustrates the 
qualitative nature of the flow at some later time 
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5.3.3 Propagation Speed. The propagation velocity of the jet through the 
ambient medium can be estimated using a one-dimensional argument, too. 
One equates the (one-dimensional) momentum flux of the beam gas and the 
ambient medium in a reference frame moving with the working surface. For 
pressure-matched jets an estimate of the Newtonian jet velocity, is then 
given by (Norman et al. 1983) 



v\ — 



i + v^ 



Ub 



(5.10) 



According to this formula, light jets (t^ < 1) propagate at a small fraction 
of the beam flow velocity and only very dense jets (7/ 1) have propagation 

speeds close to the beam velocity. Note that Eq. (5.10) has been obtained on 
the basis of the head-on momentum transfer between beam and ambient gas 
disregarding any multi-dimensional effects as, e.g.,the sideways expansion of 
the head of the jet. Hence, the estimate has to be considered as an upper 
bound of the actual jet propagation velocity. In the large sample of mod- 
els studied by Norman et aJ. (1983), dense jets are seen to propagate most 
efficiently. Their speed, however, never exceeds 80% of the one-dimensional 
estimate. Low Mach number light jets are even less efficient with a computed 
jet velocity of 40% of the ID estimate. Only hypersonic (Mb > 30), suffi- 
ciently dense (rj > 0.03) jets, as considered by Massaglia et aJ. 1996, seem to 
have propagation efficiencies equal one. 



5.3.4 Cocoon Prominence. The mass flux across the terminal shock 
(which when integrated over time gives to a good approximation the mass 
within the cocoon) is proportional to the beam fluid velocity relative to the 
Mach shock. Taking into account that the Mach shock propagates almost at 
the advance speed of the jet, Eq. (5.10) allows one to determine the relative 
prominence of the cocoon in classical jet models. 

Diffuse or light jets {rj < 1) propagating at a small fraction of the beam 
flow velocity create a pile-up of shocked beam material which feeds an ex- 
tended and turbulent cocoon. The prominence of the cocoon diminishes with 
increasing tj, i.e., when the jet becomes more and more dense, and when re- 
ducing the beam Mach number at a flxed beam density, i.e., when the jet 
becomes less and less supersonic. Dense or heavy jets ( 7 ? > 1), on the con- 
trary, have a small mass flux into the cocoon leading to morphologies in which 
the beam is the most prominent element (“naked beams”). 

5.3.5 Jet Collimation and Stability. The shear flow at the beam cap 
generates an intense vortex, which drives a strong shock into the beam. This 
shock is reflected back and forth between the axis and the beam boundary 
leading to a regular shock structure in the beam with alternating incident 
and reflected shocks (Fig. 5.4), where incident and reflected refers to the sym- 
metry axis. These internal shocks are responsible for the collimation of the 
beam, even if there is a substantial overpressure in the beam, compared to 
the cocoon (Falle 1987). The reason for this is that in an oblique shock only 




450 E. Muller 



the velocity component normal to the shock is reduced, while the tangential 
component is steady (Fig. 5.6 upper panel). This simple picture is compli- 
cated by the fact, that the shocks are boundary lines of plane rarefaction 
waves towards the axis and of centered rarefaction waves towards the beam 
boundary (Fig. 5.4). The centers of the latter are at the reflection points of 
the internal shocks at the beam boundary. In these waves the velocity vector 
is steadily deflected away from the axis (Fig. 5.6 lower panel; e.g., Landau & 
Lifshitz 1959). The net effect of shock collimation and centered rarefaction 
decollimation is an almost constant beam diameter. In the plane rarefaction 
waves inside the shock structure the magnitude of density and of pressure 
decreases, while the absolute value of the velocity increases. Thus the density 
and pressure have local minima just upstream and local maxima just down- 
stream of the reflection points at the axis and vice versa for the velocity. 

During the propagation of light jets direct interactions between Kelvin- 
Helmholtz instabilities in the cocoon and the beam can be observed. These 
interactions considerably disturb the internal structure of the beam, which 
becomes highly time dependent. The initially stationary internal shocks start 
propagating downstream and cause reflected shocks when they interact. Thus, 
shock waves propagating upstream can be observed all over the beam. In ad- 
dition, due to the interaction of the non-stationary internal shocks and the 
Mach disk, the latter is temporarily substituted by a weaker oblique internal 
shock, which decelerates the beam flow less than the Mach disk does. Conse- 
quently, the beam cap is slowed down and reaccelerated almost periodically. 
The resulting non-steady propagation of the jet leads to Rayleigh-Taylor 
unstable conditions at the working surface (similar to the non-steady propa- 
gation of the shock in a supernova envelope; see Sect. 3.4), which give rise to 
the growth of vortices at the beam cap. 

Whenever the beam cap is accelerated a smaller secondary bow shock 
is formed, which outruns the leading bow shock. This interaction produces 
a disturbance in the bow shock, which propagates away from the axis. Be- 
cause of the non-steady propagation of the jet higher and higher modes of 
the Kelvin-Helmholtz instability are excited in the cocoon. These Kelvin- 
Helmholtz instabilities give rise to a feed-back loop operating on a time scale 
determined by the sound crossing time of the cocoon, or respectively, by the 
shock propagation time along the beam. The feed-back loop is triggering and 
maintaining the observed beam “pumping” (j.e.,the non-steady propagation 
of the jet). It is probably also the cause for the continued decrease of the 
wavelength of the Kelvin-Helmholtz instability. The flnai outcome of all this 
is that at least the cocoon (i.e., the jet boundary layer) will eventually become 
turbulent. 

The situation is quite different for heavy jets. Their beams almost com- 
pletely lack internal structure, because of pressure equilibrium between the 
beam and its surroundings (i.e., the ambient medium, as there exists no 
cocoon). Therefore, the beam/cocoon interface of heavy jets is very sta- 
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centered 

incident rarefaction reflected 




Fig. 5.6. Collimation of a supersonic jet: The top panel shows that oblique shocks 
alone tend to focus the flow (tangential and normal components of velocity are in- 
dicated) . However, the combined action of oblique shocks and of pleme and centered 
rarefaction waves (lower panel) gives rise to an almost constant beam diameter. 
(Prom Kossl & Muller 1988) 

ble against the growth of pinch instabilities that would evolve into internal 
shocks. 



5.4 Relativistic Simulations 

5.4.1 Basic Equations. The equations of special relativistic hydrodynam- 
ics are a set of conservation equations (units are chosen such that the speed 
of light c = 1; see also Sect. 8 of the lecture by Le Veque) 

dP 
dt 



+ V{Dv) = 0 , 



(5.11) 
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i = l,2,3, 



(5.12) 

(5.13) 



_ + v{s^v) + {Vpy = o, 

^ + V[(r+p)u] = 0, 

for the conserved rest-mass density momentum density S = 

and energy density r, respectively. These variables, which are defined in a 

fixed frame, axe related to quantities in the local rest frame of the fluid 

through 



D = qW , (5.14) 

S - QhW\, (5.15) 

r = ghW^-p-D, (5.16) 



where g, p, v, W and h are the proper rest-mass density, the pressure, the 
flow velocity, the flow Lorentz factor and the specific enthalpy, respectively. 
The latter two quantities are given by 



W = 



1 

\/l - V ‘ V 



and 

V 

h ~ \ — , 

Q 



(5.17) 

(5.18) 



where e is the specific internal energy. 

An important difference between Newtonian and relativistic dynamics is 
the presence of a maximum velocity, i.e.,the speed of light in vacuum in the 
latter case. This implies that relativistic flows can no longer be scaled in space 
and time separately, but instead both scales are related by the speed of light. 
Hence, beyond K, rj and Mb (see Sect. 5.2.1), an additional fourth parameter 
(for a given adiabatic index) is required to completely specify a relativistic 
jet propagating into a homogeneous medium. Commonly, one chooses the 
beam flow velocity, (measured in units of c) as the additional fourth flow 
parameter. 

The existence of an additional flow parameter reflects the fact that a jet 
can be relativistic in two ways: in a kinematically relativistic jet the beam 
gas moves with relativistic velocity {W > 1), while in a thermodynamically 
relativistic jet the specific internal energy of the beam gas is large compared 
to its rest mass energy (/i ^ 1). Hence, the latter jets are called “hot” jets, 
and the former ones “cold” or “highly supersonic” jets (Marti et al. 1996). 
Note that the effective inertia of the flow increases in both cases, which 
is important for a qualitative understanding of the simulation results (see 
below). 

For large specific internal energies the local sound speed, Vs, approaches 
a maximum value of 

c 

73 ’ 



^ max _ j ^ _ 



(5.19) 
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where the second equality in Eq. (5.19) holds for an ultra-relativistic gas with 
7 = 4/3. The corresponding maximum proper sound speed is given by 



^max ^ max^ max _ 

S S S / ? 

yj 1 



(5.20) 



where (Tyj”^) Ws is the (maximum) Lorentz factor corresponding to the 
(maximum) local sound speed (t;^®^) Vs (Konigl 1980). 

These considerations show that relativistic effects arising from the pres- 
ence of large flow velocities are directly controlled by choosing an appropriate 
beam flow velocity t?b. Once is specified, the value of the beam Mach num- 
ber, Mb = v\y/vsy must be larger than 

which is the minimum Mach number for a flow with adiabatic index 7 mov- 
ing at speed v\y. Values of Mb near are obtained for “hot” jets. The 

minimum proper Mach number is given by 

which plays the role of the classical Mach number in two-dimensional, steady 
relativistic hydrodynamics, and which only depends on the beam flow veloc- 
ity. Note, that the second equalities in Eqs. (5.21) and (5.22) again hold for 
an ultra-relativistic gas with 7 = 4/3. 



5.4.2 Numerical Methods and Simulations. Until recently, severe nu- 
merical difficulties arising in the integration of the special relativistic hydro- 
dynamic equations (Eqs. (5.11) - (5.13)) had restricted the research of rela- 
tivistic jets to the stationary regime (Wilson 1987; Daly & Marscher 1988; 
Dubai & Pantano 1993; Bowman 1994). The major difficulty is caused by 
the strong coupling of the equations via the Lorentz factor, which contains 
all velocity components and appears in all equations (Norman & Winkler 
1986). Older numerical schemes were also less accurate and relied on artifi- 
cial viscosity to handle shock waves (e.g., Potter 1973). However, an accurate 
treatment of shocks is crucial in relativistic hydrodynamics, because there ex- 
ists no limit to the compression ratio across a shock as the upstream Lorentz 
factor tends to infinity. Thus, relativistic shocks resemble isothermal shocks 
in Newtonian hydrodynamics producing very thin dense regions, which are 
difficult to resolve adequately (Norman & Winkler 1986). 

The development of high- resolution shock-capturing finite volume schemes 
(see Sect. 3.7 of the lecture by Le Veque) and their application to special rel- 
ativistic flows has made it possible to overcome the numerical difficulties 
arising in the older methods (e.g., Marti & Muller 1994, 1996; Marti 1998). 
There axe two main reasons for this success. Firstly, modern schemes do not 
require any artificial viscosity. Secondly, and even more important, the new 
methods explicitly take into account the non-linear character of the set of 
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hyperbolic partial differential equations using exact or approximate (relativis- 
tic) Riemann solvers. Modern schemes are able to simulate multi-dimensional 
relativistic flows with Lorentz factors W 10^ (Le.,v ~ 0.9999 c) without 
numerical problems (Marti 1998). 

During the past five years a series of 2D and 3D hydrodynamic and mag- 
netohydrodynamic simulations of relativistic jets have been performed (van 
Putten 1993; Eulderink & Mellema 1994; Duncan & Hughes 1994; Marti et 
aL 1994, 1995; Duncan et al, 1996; Hughes et ai. 1996; Koide et aL 1996; Marti 
et ai. 1996; Komissarov & Falle 1996a, b; van Putten 1996; Marti et ai. 1997a, 
b; Nishikawa et ai. 1997). The collimation of relativistic spherical outflows by 
thick accretion disks was proven by Eulderink & Mellema (1994). Van Putten 
(1993) simulated a relativistic slab jet with a beam Lorentz factor W\y ^ 3.25. 
Although he computed the propagation of the jet for a distance of seven beam 
radii only, he was able to verify the formation of a Mach disk at the end of 
the relativistic beam. Marti et ai. (1994) and Duncan & Hughes (1994) per- 
formed longer simulations in cylindrical and planar symmetry of low beam 
Mach number models (i.e., hot jets). Both groups derived similar conclusions 
concerning the morphology of these relativistic models (see below). A highly 
relativistic {W = 22) and highly supersonic axisymmetric jet was simulated 
by Marti et ai. (1995). 

Marti et ai. (1997a) performed the most comprehensive parameter study 
of the morphology and dynamics of hot and cold relativistic axisymmet- 
ric jets up to now. They used a conservative shock-capturing finite volume 
scheme in cylindrical coordinates (r,z). Numerical fluxes at cell interfaces 
are calculated by means of an approximate Riemann solver that uses the 
complete characteristic information contained in the Riemann problems be- 
tween adjacent cell (Donat & Marquina 1996). The solver is based on the 
spectral decomposition of the Jacobian matrices of the relativistic system 
of equations (Font et ai. 1994). The spatial accuracy of the algorithm is im- 
proved up to second-order accuracy by means of a conservative monotonic 
parabolic reconstruction of the pressure, proper rest-mass density and flow 
velocity following the work of Colella & Woodward (1984; see Marti & Muller 
1996 for the explicit formulae). Integration in time is done simultaneously 
in both spatial directions using a total-variation-diminishing (TVD) Runge- 
Kutta scheme of high-order accuracy (Shu & Osher 1988). A one-dimensional 
Newton-Raphson iteration is used to compute in each time step the primitive 
variables from the conserved ones (see Marti & Muller 1996 

for details). The simulations covered the evolution in a region extending up 
to 50i?b In 2 :-direction and 7i^b in r-direction. The computational domain 
was covered by a uniform numerical grid consisting of 1000 x 140 zones, cor- 
responding to a spatial resolution of 20 zones per beam radius. In order to 
delay and weaken the effect of an erroneous, numerically caused partial re- 
flection of the bow shock at the top (outflow) boundary, the equidistant grid 
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wcis extended in radial direction by 28 geometrically spaced zones covering a 
region from r = 7 to r = 10.5 

Most simulations of relativistic jets have been performed using an ideal 
gas equation of state with a constant adiabatic index of 7 = 5/3 or 7 = 4/3, 
respectively. The effects of the inclusion of a variable adiabatic index on 
the internal structure of relativistic jets have been explored by Duncan et 
ai. (1996), who used a pair-plasma equation of state, and by Komissarov & 
Falle (1997), who considered a relativistic electron-proton gas. 

The superluminal knots of VLBI sources are widely believed to be shocks 
traveling along relativistic jets. Hughes et ai. (1996) have modulated the 
Lorentz factor of the inflowing material to model the propagation of these 
knots responsible for variability. Their radiative transfer calculations show 
that nested bow shocks resulting from the varying inflow speed appear a^ a 
series of axial knots for observers close to the line of sight, closely resembling 
VLBI maps. Gomez et ai. (1997) and Komissarov & Falle (1996a) constructed 
steady state jets which were perturbed by superimposing variations of the 
inflow velocity, and then computed the synchrotron emission from the jets 
assuming an ad hoc magnetic field with an energy density proportional to 
the gas pressure. This procedure allows one to study the combined effect of 
standing and traveling shocks on the appearance of a relativistic jet. The 
results show that the complex behaviour observed in many sources can be 
accounted for by this model (Gomez et ai. 1997; see also Sect. 5.7). 

Meanwhile, one has also begun to consider the effects of a non-homogene- 
ous environment on the propagation of relativistic jets. Plewa et ai. (1997) 
have simulated a relativistic jet propagating into an “atmosphere” whose den- 
sity gradient is inclined to the jet direction. This mimics an extragalactic jet, 
whose direction of propagation does not coincide with one of the symmetry 
axes of the galaxy from which it is ejected. In order to save computational 
resources Plewa et ai. (1997) have applied the technique of adaptive mesh 
refinement (AMR; see Sect. 5.3 of the lecture by LeVeque) in a relativistic 
jet simulation. 

First simulations of relativistic MHD jets have been performed by Koide 
et ai. (1996) using a simplified total variation diminishing (TVD) method 
and by van Putten (1996) using his own so-called weak smoothing method. 
Koide et ai. (1996) considered 2D slab jets which are injected into a magnetic 
field oriented parallel to the jet direction. Their results, obtained on a grid of 
20i?b X 20 Rh with a resolution of 4 zones per beam radius, show that both 
relativistic eflfects and the parallel magnetic field help to collimate the jet. 
The axisymmetric simulations of van Putten (1996) performed on a grid of 
512 X 128 zones revealed a stagnation point-nozzle-Mach disk morphology. 
It is produced by an oscillation of the M-'^ch dick at the “hot spot”, which 
may explain the correlated optical and radio emission in the quasar 3C 273. 

While writing this article a 3D variant of the 2D MHD simulation of 
Koide et al. (1996) was published by Nishikawa et ai. (1997). The relativistic 
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jet (VEb = 4.56) is again injected into a magnetic field oriented parallel to the 
jet direction. The cubic computational domain has an edge size of 20i?b and 
is covered by 101^ zones equivalent to a spatial resolution of five zones per 
beam radius. According to Nishikawa et ai. (1997) the resulting structure and 
kinematics of the 3D jet is noticeably different from that of the 2D planar 
one: the beam decelerates more efficiently, develops no internal structures, 
and the head speed is smaller. 

5.5 Morphology and Dynamics of Relativistic Jets 

Generally speaking, the gross morphological features found in Newtonian 
calculations (bow shock, beam, working surface, cocoon) are present in rel- 
ativistic jets, too. Important, but not surprising differences were revealed 
between hot {h ^ 1) and cold {W > 1) relativistic jets (Marti et ai. 1997a; 
Figs. 5.7 and 5.8). 

Hot relativistic jets have “naked” beams surrounded by very thin cocoons 
and more extended lobes near the head of the jet (Fig. 5.7). The Mach shock 
at the jet head is permanently present during the whole simulation giving rise 
to a very steady propagation of the jet. In hot jets the backflow appearing 
at the working surface is minimized or even non-existing. Moreover, hot jets 
almost completely lack internal structure, as was first pointed out by Marti 
at ai. (1994) and Duncan & Hughes (1994). The absence of shocks in hot 
beams is a consequence of the pressure equilibrium between the beam and the 
surrounding cocoon/shocked ambient medium. Hot jets are also very stable. 
Neither the reflecting mode nor the fundamental, or ordinary, pinch mode 
(typical of low beam Mach number flows; e.g., Norman et ai. 1984) seem to 
develop in hot jets (at least for the time scales covered by the simulations). 
This result is in accordance with a study of Kelvin-Helmholtz instabilities in 
the linear regime by Ferrari et ai. (1978), who found relativistic, hot jets to 
be unconditionally stable. 

Cold or highly supersonic jets display a very rich and complex structure 
within the beam and the cocoon (Fig. 5.8). They have beams with internal 
oblique shocks separating compressed and rarefied regions and collimating the 
flow. The angle formed by the oblique shocks with the jet axis decreases with 
the proper beam Mach number. Cold jets are further dominated by a thick 
cocoon which tends to become less prominent as the beam Lorentz factor 
increases. Vortex shedding at the head of the jet feeds the cocoon with vor- 
tices, which grow as they are advected upstream and eventually give rise to a 
turbulent thick cocoon (Fig. 5.8). The cocoon is overpressured with respect to 
the beam, the overpressure being roughly proportional to the corresponding 
beam flow Lorentz factor. The existence of such over pressured cocoons was 
already pointed out by Begelman & Cioffi (1989) and confirmed by Loken 
et ai. (1992) in the case of Newtonian hypersonic jets (see also Massaglia et 
ai. 1996). The overpressured cocoon helps to confine the jet during its early 
evolution and is probably of relevance for the formation of knots observed 
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Fig. 5.7. Color-coded contour plots of the logarithm of the proper rest-mciss density 
(top) and pressure (bottom) of a hot, relativistic jet (77 = 0 . 01 , Ub = 0.99, Alb = 
1.72, and 7 = 4/3). The maximum values are coded in white (log = 0.62; log p = 
0.50). Decreasingly smaller values are coded in green, bright blue, dark blue, red 
and the minimum values axe coded in black (logp = — 3.04;logp = —1.77). The 
jet has a “naked” beam and almost completely lacks internal structure. The beam 
is surrounded by lobes (near the jet head) instead of a cocoon and is in pressure 
equilibrium with the shocked ambient medium. (From Marti et aJ. 1997a) 




Fig. 5.8. Same as 5.7, but for a cold, relativistic jet (77 = 0.01, Ub = 0.99, A4b = 
6.0, and 7 = 5/3). The maximum values are log q = 0.63 and logp = —0.82, and the 
minimum ones are log p = 2.93 and logp = —4.17. The jet has a beam with internal 
oblique shocks and an extended turbulent cocoon. (Prom Marti et aL 1997a) 
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in large-scale jets of many powerful radio sources. In models with 7 = 4/8 
the contact discontinuity is positioned very close to the bow shock (Marti ct 
hL 1997a). This gives rise to bow shock perturbations causing changes in its 
overall shape, which are reminiscent of the “nose cones” observed in classical 
MHD jets (Clarke et aL 1989; Lind et al 1989). 

5.5.1 Propagation Efficiency. Similar to the classical case, an estimate 
of the jet velocity can be obtained by equating the momentum flux of the 
beam and tho ambient gas in the frame of the working surface (Eq. (5.10)). 
Because both, large specific internal energies (compared to c^) and relativis- 
tic velocities are encountered, the momentum flux balance yields (Marti et 
al 1997a) 

(5.23) 

for a beam in pressure equilibrium with the ambient medium. The symbol 
(') indicates that the corresponding velocities are measured in the reference 
frame of the working surface. The estimated (relativistic) velocity of the jet 
in the rest frame of the ambient medium, can be introduced 

in Eq. (5.23) taking into account that the following relation between primed 
and unprimed velocities holds 

H"i>'b = . (5.24) 

Then one has 



whidi after .some rearrangement gives (Marti et al. 1997a) 



(5.25) 



Vfi = , 

1 + 

where r/p is defined as 
vk = vrWI 

and is given by 
Qb^i'h 



m 



Qtn 



(5.26) 



(5.27) 

(5.28) 



Note that for fluids with non-relativistic internal energies tjr tends to r; and 
for those having both non-relativistic internal energies and non-relativistic 
velocities tends to 77 . 

In the classical case only dense (i.e.,77 ^ 1) jets have propagation speed 
estimates close to the beam flow velocity (Eq. 5.10)). However, for a rela- 
tivistic jet the condition ^ is fulfilled whenever y/^ ^ 1, i.e.,for 
ultra-relativistic and/or extremely hot jets. This result is easily understood, 
because in relativistic dynamics both the internal energy and the Lorentz 
factor contribute to the increase of the inertial mass density. 
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The study of Marti et aJ. (1997a) yielded estimates of the jet propagation 
speed in the range 0.17 < < 0.94. The corresponding propagation effi- 
ciency 5 = /V^ (i.e.,the ratio of the numerically computed and the 

analytically estimated velocity of a relativistic jet) was found to lie in the 
range 0.76 < S < 1.24. Efficiencies very close to one (i.e.,6 = 1.00 ± 0.02) 
are obtained for hot jets. The lack of internal structure within the beam and 
the insignificant or non-existing cocoon are responsible for the almost one- 
dimensional behaviour of these jets. Models which are highly supersonic and 
have a small adiabatic index (7 = 4/3) have efficiencies larger than 100%. 
This remarkable result is caused by an acceleration of those jets during an 
early stage of their evolution after the first oblique internal shock has formed 
within the beam. 

The propagation efficiencies of relativistic jets are significantly larger than 
those of Newtonian jets. Norman et aJ. (1983), for example, obtained efficien- 
cies in the range 0.49 < <5 < 0.90 for their comprehensive set of Newtonian 
jet models including light and heavy jets as well as highly supersonic ones 
(Mb < 12). The high efficiencies found by Marti et al. (1997a) are only com- 
parable with those of the hypersonic Newtonian models (Mb ~ 30 — 300) 
computed by Massaglia et al, (1996). 

5.5.2 Estimate of the Cocoon Meiss. Marti et al. (1997a) also derived an 
estimate of the total mass within the cocoon determined by the time integral 
of the mass flux across the terminal Mach shock. For a cylindrical flow of cross 
section 7tR\ and for a constant beam flow characterized by the parameters, 
7;, t?b, the estimate is 

•rricocoon = -KRXobv'^Wl^t' , (5.29) 

where V is the time measured in the reference frame of the Mach shock. In 
terms of quantities measured in the rest frame of the ambient medium the 
previous equation reads 

^cocoon ^ 7T-Rb^bM^b(wb - , (5.30) 



where Vu is the velocity of the Mach shock measured in the rest frame of 
the ambient medium. 

If one wants to compare different models at some epoch, say when the 
jet has propagated along a distance L, then for each model the appropriate 
evolutionary time to be considered is f = L/V^, Under these conditions the 
total mass within the cocoon scales as 



^cocoon 



OC T]Wh 



Vh - 



J 



(5.31) 



Taking into account that Um ^ and substituting the propagation speed 
of the jet by the estimate of Eq. (5.26), an extremely simple expression for 
the cocoon mass in terms of initial beam parameters can be derived: 

fj 

^cocoon OC (5.32) 
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Consistent with this expression is the result that hotter jets ( 77 r ^ rj) will 
have increasingly less prominent cocoons. Moreover, the fact that the cocoon 
mass does not depend on the beam flow velocity, and the fact that the density 
of the shocked beam gas increases with increasing beam Lorentz factors, 
explain the thinner cocoons of faster jets, 

5.5.3 Comparison with Newtonian Jets. Qualitatively speaking, the 
properties of relativistic jets resemble those of dense Newtonian jets, because 
relativistic effects enhance the effective inertia of the jet in a twofold way 
(see Sect. 5.5.1). As the enhanced inertia makes it more difficult to perturb 
relativistic jets, they are more stable than Newtonian jets. That is also the 
reason why the one-dimensional propagation speed estimates agree so well 
with the numerically obtained jet speeds, i.e., why relativistic jets propagate 
so efficiently compared to their Newtonian counterparts. 

When comparing Newtonian and relativistic jets quantitatively an ambi- 
guity arises. Relativistic jets are completely characterized by four parameters 
(three in the case of pressure-matched jets), while Newtonian jets require one 
parameter less (see Sect. 5.4.1). Hence, it is not a priori clear which parameter 
one allows to vary in the comparison. 

Marti et ai. (1997a) compared models which had the same rest mass den- 
sity ratio 77 and the same (classical) beam Mach number Mb, i.e., the beam 
flow velocity or the beam Lorentz factor was allowed to vary. This is a per- 
fectly reasonable choice, if one only wants to study the generic properties of 
relativistic jets. Komissarov & Falle (1996b, 1997) argue that this choice is not 
appropriate for a comparison between relativistic and Newtonian jets. If one 
holds all flow parameters fixed except for the Lorentz factor, as in the work 
of Martf et ai. (1997a), one compares jets with different power and thrust. 
Instead, Komissarov & Falle (1996b, 1997) suggest that one should compare 
Newtonian and relativistic jets with the same beam velocity and power, the 
latter quantity being the most relevant to extragalactic jets, Komissarov & 
Falle (1997) found that for their choice of parameters the behaviour of rel- 
ativistic and classical jets is very similar, except that the relativistic jets 
generate lighter cocoons because of their smaller mass flux. This difference is 
not very dramatic, however, and probably indistinguishable by observations. 



5.6 Long Term Evolution of Relativistic Jets 

Jets in powerful extragalactic radio sources (FRII radio galaxies and quasars) 
seem to have been feeding the lobes in these sources for periods as long 
as 10^ yr with kinetic powers in the range lO^^-lO'^^ergs"^ (Rawlings & 
Saunders 1991; Daly 1995). These powerful jets are supposed to be relativistic 
on parsec-scales (see Sect. 5.1). On larger scales, flux asymmetries between 
jets and counter-jets indicate that relativistic motion extends out to kpc- 
scales, although with smaller values of the overall bulk speeds (Bridle et 
aJ. 1994), making it necessary to look for adequate models of flow deceleration 
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between both scales. Finally, an effective deceleration should occur at the 
terminal hot spots for which advance speeds in the range 0.01c - 0.1c axe 
inferred (Liu et aJ. 1992; Daly 1995). 

Numerical simulations of jets typically cover the propagation of the jet 
for a few tens of initial beam radii, corresponding to time scales much smaller 
than those in real sources. Recently however, a first step towards a numerical 
study of the deceleration mechanism has been made by Marti et al. (1997b), 
who simulated the evolution of a relativistic jet modeling that of the powerful 
radio source Cygnus A. The initial conditions were chosen to be Vb = 0.986 c 
(i.e.jlFb = 6.0), 7/ = 3.4 X 10“^ and Mb = 9.0. An ideal gas equation of 
state with 7 = 5/3 was used. The jet was assumed to propagate through 
a uniform, static atmosphere in pressure equilibrium with the beam at the 
injection point. The model corresponds to a powerful jet (beam kinetic power 
10^^ erg s"^) when scaled according to a beam radius of iZb = 0.35 kpc and an 
ambient medium density = 10“^®gcm“^. From Eq. (5.26) the estimated 
hot spot propagation speed is - 0 .26 c. This is significantly larger than 
the hot spot advance speeds inferred from powerful radio sources (0.01 c to 
0.1 c; Liu et al. 1992). The evolution of the jet was simulated for a period of 
3 X 10® yr, which is about one tenth of the estimated spectral age of many 
powerful radio sources. 

The overall morphology found by Marti et al. (1997b) was typical of a 
light, highly-supersonic jet characterized by an extended overpressured co- 
coon and a beam with internal conical shocks. After a short one-dimensional 
phase the evolution is characterized by a strong deceleration followed by a 
long-term phase of steady evolution. During the deceleration phase the jet 
speed decreases from about 0.26 c to less than 0.10 c, because the beam flow 
is slowed down through internal shocks and broadened near the hot spot. The 
numerical resolution (4 zones per beam radius) was insufficient for capturing 
the process of mass entrainment (e.g., Norman 1996a). Because of the decel- 
eration of the jet, beam gas starts to inflate large lobes around the jet’s head, 
similar to those observed in many powerful radio sources (e.g.,in Cygnus A; 
Carilli et al. 1996). 

The supersonic propagation of the jet through the external medium drives 
a bow shock that pushes the ambient gas, causing the jet to propagate in a 
region of low density (the cavity). The evolution of the cavity pressure can 
be inferred from the evolution of the thermal energy within the cavity, , 
and from the cavity’s volume (Begelman & Cioffi 1989) 

. (5-33) 

where /j and Xcav are the length of the jet and the width of the cavity, 
respectively. 

With ^ ^5 /j oc t (assuming Vhs = 'i^hs) ^cav (approximat- 

ing the bow shock, which defines the cavity’s volume, as a strong shock) one 
finds from Eq. (5.33) that the pressure inside the cavity scales as pcav oa t~^. 
From their simulation, Marti et aJ. (1997b) obtain pcav oc the difference 
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being mainly due to the slower increase of the cavity volume as a consequence 
of the deceleration of the hot spot (/j a instead of cx i; see also Cioffi & 
Blondin 1992). 

5.7 Simulation of Parsec-Scale Jets 

Until recently, the simulation of parsec-scale jets and superluminal radio com- 
ponents has been restricted to simplified stationary models (Marscher & Gear 
1985; Hughes et aJ. 1989, 1991; Gomez et aJ. 1994a, b). The presence of ra- 
diation emitting flows at almost the speed of light enhances the importance 
of relativistic effects in the appearance of these sources (relativistic Doppler 
boosting, light aberration, time delays). Hence, models require the use of a 
combination of hydrodynamics and synchrotron radiation transfer when try- 
ing to explain the observations. Additional assumptions must be made to 
determine the radio emission (i.e., synchrotron emissivity and opacity) asso- 
ciated with a given flow. 

Modeling the radiation signature of superluminal components in parsec- 
scale relativistic jets Gomez et ai. (1997) performed hydrodynamic calcula- 
tions using the relativistic time-dependent code of Marti et ai. (1997a). In 
order to compute the radio emission from their hydrodynamic jet models, 
Gomez et ai. (1997) distribute the internal energy density among the elec- 
trons following a power law and assume that the magnetic energy remains 
a fixed fraction of the particle energy density. The emission is then calcu- 
lated by integrating the synchrotron transfer equations while accounting for 
relativistic effects such as Doppler boosting and light aberration (for more 
details see Gomez et ai. 1995). Komissarov & Falle (1996b, 1997) solve the 
transfer equations for the total intensity in the optically thin limit. In order 
to account for light- travel time delays within the jet, retarded emission and 
absorption coefficients are considered in the transfer equations. 

In the hydrodynamic models discussed by Gomez et ai. (1997) and by 
Komissarov & Falle (1996b, 1997) moving radio components are obtained 
from perturbations of steady relativistic jets. The jets are assumed to propa- 
gate through an atmosphere of decreasing pressure. This environment causes 
an expansion of the jets in accordance with the observed jet opening an- 
gles. Where pressure mismatches exist between the jet and the surround- 
ing atmosphere reconfinement shocks are produced. These shocks are either 
isolated (Komissarov &: Falle 1996b, 1997) or form a sequence (Gomez et 
ai. 1995, 1997). The energy density enhancement produced downstream from 
the shocks gives rise to stationary radio knots similar to those observed in 
many VLBI sources. Superluminal components are produced by triggering 
small perturbations in the steady jets which propagate almost at the beam 
flow velocity. Typical perturbation setups involve sinusoidal variations or flow 
discontinuities. Perturbations of the first kind were imposed by Mioduszewski 
et ai. 1997 (see also Hughes et aJ. 1996), who considered sinusoidal variations 
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of the jet Lorentz factor between 1 and 10, However, because of their large 
amplitude, the perturbations propagate only at barely relativistic speeds. 

In the simulated radio maps the perturbations appear as regions of en- 
hanced emission moving downstream at superluminal apparent velocities. 
Interactions of the perturbations with the underlying steady jet result in 
changes in the internal brightness distribution of the superluminal compo- 
nents (Gomez et aJ. 1997). This shows that the dynamic interaction between 
the induced traveling shocks and the underlying steady jet can account for 
the complex behavior observed in many extragalactic radio sources. A list 
of these observations and their possible interpretation within the hydrody- 
namic model of compact extragalactic radio sources can be found in Gomez 
et ai.(1997). 



6. Smoothed Particle Hydrodynamics 

It is commonly known that it is very efficient to compute multi-dimensional 
integrals using the Monte Carlo method. Based on this fact Lucy (1977) 
and independently Gingold & Monaghan (1977) developed a Monte Carlo 
approach to solve the multi-dimensional hydrodynamic equations. The ap- 
proach is called Smoothed Particle Hydrodynamics (SPH). The main advan- 
tage of the method is that is does not require a computational grid and that 
it is a Lagrangian method. Thus, in multi-dimensional calculations no com- 
putational power is wasted in simulating large voids which is often necessary 
when grid-based finite volume methods are used. The SPH algorithm is not 
too complex, a simple implementation typically requiring about 1000 lines 
of FORTRAN code and, moreover, the scheme is very robust. This robust- 
ness, however, implies some potential risk: If the code contains a severe bug, 
most finite volume schemes tend to crash after a few time steps, while SPH 
keeps on running and produces unphysical results (e.g*., superluminal speeds). 
Furthermore, for some problems SPH seems to be able to get very accurate 
results with a surprisingly small number of particles (~ 10^ or even less). 
This does not imply that SPH is able to solve supercomputer problems on 
workstations without a loss of accuracy, although some authors seem to argue 
in that direction (e,g.,the articles about SPH and the following roundtable 
discussion in Trease et aJ. 1991). On the other side, if indeed more than 10^ 
particles have to be used for certain problems, and if self-gravity has to be 
included, such a simulation would exceed even the capabilities of modern 
supercomputers, because like in any N-body algorithm the computational 
power required grows as the square of the particle number N. 

Three modifications of the original algorithm have increased the poten- 
tial of SPH remarkably: (i) the introduction of a variable smoothing length 
has considerably enhanced (for a given number of particles) the resolution 
capabilities (e.g., Hernquist & Katz 1989; Benz 1990); (ii) the use of indi- 
vidual time steps for each particle allows one to overcome global time step 
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limitations due to stability criteria like the Courant condition (Porter 1985; 
Jernigan & Porter 1989; Hernquist & Katz 1989); and (iii) the use of the 
hierarchical tree method to calculate the gravitational potential has reduced 
the original iV^-algorithm to a iV log A^-algorithm without sacrificing the free 
Lagrange nature of SPH (Porter 1985; Barnes & Hut 1986). 

Concerning the computational performance of SPH, several authors have 
shown that it is possible to vectorize both the hierarchical tree method and 
the SPH algorithm (Hernquist 1990; Makino 1990a; Barnes 1990; Haddad et 
al. 1991). Makino & Hut (1988) and Theuns & Rathsack (1993) have imple- 
mented the hierarchical tree method on a massively parallel computer. Thus, 
SPH can efficiently be used on different kinds of supercomputers, but the 
original simplicity of the algorithm has partially been lost. However, com- 
pared with multi-grid or adaptive-grid finite volume methods SPH is still 
remarkably simple. 

In the following the SPH algorithm is briefly reviewed (for a more thor- 
ough discussion see, e.g., Hernquist & Katz 1989; Benz 1990; Monaghan 1985, 
1992 and the references therein). A detailed description of a specific imple- 
mentation of SPH which uses the hierarchical tree method to handle self- 
gravity can also be found in Steinmetz & Muller (1993). The latter work also 
contains a set of demanding test problems (see Sect. 6.6) and astrophysical 
applications (see also, e.g., Steinmetz 1996b), which exhibit both the capabil- 
ities and limits of SPH. A related investigation has been performed by Davies 
et aJ. 1993, who compared the performance and accuracy of the SPH imple- 
mentation of Benz (1990) with that of the PPM multi-grid implementation 
of Ruffert (1992) when simulating stellar collisions. 

Recent developments concerning the SPH algorithm focus on the incorpo- 
ration of Riemann based solutions of the equations of hydrodynamics (Mon- 
aghan 1997), on the extension of SPH to ultra relativistic hydrodynamics 
(Chow & Monaghan 1997), on an improved prescription for the artificial vis- 
cosity in SPH (Balsara 1995; Watkins et al. 1996; Selhammar 1997a), on the 
use of nonspherical kernels in SPH (Fulbright et al. 1996; Shapiro et al. 1996; 
Owen et al. 1997; Selhammar 1997b), and on the stability of SPH (Swegle et 
al. 1995; Moris 1996). 

6.1 The SPH formalism 

Consider an ensemble of N particles with position r{ and mass m distributed 
in space according to the density g{r). Following Gingold & Monaghan (1977, 
1982) a smoothed estimate of the density g is given by 

^^(r)) = J g{r')W{r — r' ,h) 

» ^mW(r - rj,/i) , (6.1) 

where W{r,h) is the so-called kernel, a function strongly peaked around 
|r — r'l =0, and h is the so-called smoothing length. This approach can be 
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generalized to an arbitrary physical quantity A{r): 

{A{r)) = j d^r'A{r*)W{r -r\h) 

« ( 6 - 2 ) 

By a Taylor series expansion of the integrand in Eq. (6.2), one can show 
that A and its smoothed analogue differ by terms proportional to i.e,, the 
scheme is second-order accurate in space. Because the volume integrals extend 
over all space, one easily obtains the smoothed estimate of the time derivative 
of A as 

sW> = (^>. («) 

Using integration by parts spatial derivatives of a physical quantity can be 
transformed into spatial derivatives of the kernel, i.e., 



V{A{r)) = (VA(r)) 






(6.4) 



Note that in the case of a variable smoothing length the operator () does not 
commutate with d/dt and V, i.e., the time derivative and the gradient of the 
smoothing length occur in the equations above. Furthermore, it is not clear, 
if the scheme is then still second-order accurate in space. 

After these considerations it still remains to discuss the particular shape 
of the function W{r — r\h). Various types of spherically symmetric kernels 
have been suggested over the years (Monaghan 1985; Benz 1990). Among 
those the spline kernel (Monaghan & Lattanzio 1985), mostly used in current 
SPH-codes, yields the best results: 



W{r-r',h) = 






- |u2 + |u3 , 


0 < u < 1 


K2-«)^ 


1< ti < 2 


0, 


u > 2 



(6.5) 



with u = |r — r'|//i. It reproduces constant densities exactly in ID, if the 
particles are placed one a regular grid of spacing /i, and has compact support. 

Anisotropic kernels have been first applied by Bicknell & Gingold (1985). 
Since a few years they are investigated more thoroughly (Fulbright et ai. 1996; 
Shapiro et aJ. 1996; Owen et aJ. 1997; Selhammar 1997b). The advantage of 
such anisotropic kernels is that they can more fully adapt to flow problems 
with a large degree of anisotropy, as e.g., in the case of large-scale cosmological 
structure formation, and provide an optimal spatial resolution. The improved 
spatial resolution is achieved, however, at the price of having to deal with 
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tensor smoothing. Consequently, the particle forces are no longer spherically 
symmetric (Eq. (6.6)), and thus angular momentum conservation cannot be 
guaranteed. 

Given the smoothed estimates of physical quantities and their deriva- 
tives above there is no problem to find the SPH formulation of the hydrody- 
namic equations (e.g., Monaghan 1985; Benz 1990). However, if one applies 
the smoothing procedure in a straightforward way, the forces are not anti- 
symmetric, i.e.jthe force exerted by a particle i on another particle j is not 
equal to the negative of the force exerted by particle j on particle i. This 
violates the conservation of linear and angular momentum. Using adequate 
operator identities one can derive several SPH formulations of the hydrody- 
namic equations with anti-symmetric forces (Monaghan 1985, 1992). Start- 
ing, for example, from the Lagrangian of hydrodynamics (e.g., Soper 1976), 
one substitutes the matter density by its SPH approximation (Eq. (6.1)), and 
minimizes the resulting SPH Lagrangian with respect to the streamlines. This 
procedure yields the SPH equation of motion 

^ - 7 , ( 6 . 6 ) 

where Vi, d/dt and Qij are the gradient operator with respect to the coor- 
dinates of the z-th particle, the Lagrangian time derivative and the artificial 
viscosity tensor, respectively. The artificial viscosity is required in SPH to 
handle shock waves. It poses a major obstacle in extending SPH to relativis- 
tic flows, because it is not clear what form it should take (Israel 1987; Chow 
& Monaghan 1997). 

Because in Eq. (6,6) the (spherically symmetric) kernel W only depends 
on the absolute value of ri — rj (Note that this does not hold for anisotropic 
kernels!), both the pressure term and the gravity term consist of a superposi- 
tion of central forces. Consequently, both linear and angular momentum are 
(globally) exactly conserved. The gradient of the gravitational potential is 
given by 

S/i^{ri) ~ G^rrii f drr^W{r,h) 3 , (6.7) 

j=l 

where uj = \vi - Vj\. 

Using the first law of thermodynamics and applying the SPH formalism 
the energy equation can be written (in terms of the specific internal energy 
e) as 

p. ^ 

— = -^Y^m{vi-Vj) ■ViW{\ri~rj\,h) 

j 



-f 



( 6 . 8 ) 
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where the second term describes the heat generation due to the artificial 
viscosity. 

Monaghan & Gingold (1983) proposed the following form for the artificial 
viscosity tensor: 



f . (»*» - ‘Tj) ■ (wi - Vj) < 0 




0 , 



otherwise 



(6.9) 



with 



l^ij 



hij{vi-Vj){ri-rj) 



( 6 . 10 ) 



where Cjj, hij and Qij are the arithmetic means of the sound velocity c, the 
smoothing length /i, and the density respectively. The quantity 77 « O.lh is 
a smoothing parameter to prevent Qij from becoming infinite, if \vi — Vj\ ^ 0 
and rij ->■ 0. The parameters a and /? mimic a first and a second Navier- 
Stokes viscosity coefficient. Usually one uses a = 0.5 and /? = 1, but for 
problems involving strong shocks a choice of a = 1 and 0 = 2 is more 
appropriate to avoid post-shock oscillations. Because of the specific form of 
fiij the Monaghan & Gingold (1983) viscosity tensor is non-zero for shear 
flows, and hence does not guarantee local angular momentum conservation. 
An improved form of the artificial viscosity tensor has been proposed by 
Balsara (1995). In case of a compressive, shear-free flow (f.e., V * v ^ 0 and 
V X V = 0) it is identical to the artificial viscosity of Monaghan & Gingold 
(1983), but it is greatly reduced in shear flows. Consequently, unphysical 
transport of angular momentum can be reduced significantly. 

Even though the energy equation is not explicitly symmetric in i and j, 
one can show that nevertheless energy is conserved (Benz 1990). However, in 
one aspect the conservation of energy and momentum differ. If one integrates 
the equations in time by means of finite differences momentum is automati- 
cally conserved due to the anti-symmetry of the forces. In contrast, energy is 
only conserved if one uses time-centered velocities in Eq. (6.8), i.e., replacing 
the time-centered velocities by the respective forward extrapolated velocities 
leads to energy non-conservation. The conservation laws are also violated, 
if the hierarchical tree method is used to determine the gravitational po- 
tential, because the forces are no longer anti-symmetric for distant particles. 
However, the resulting errors are not large. They are of the order of a few 
percent after several thousand time steps in a simulation with a few hundred 
particles, and they become negligible if the number of particles exceeds a few 
thousand (Steinmetz & Muller 1993). 



6.2 Self-gravity 

Generally, self-gravity plays an important role in astrophysicaJ problems and 
seldomly can be ignored. In principle, it is quite simple to include self-gravity 
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in SPH. In order to obtain the gravitational force acting on a particle i one 
has to sum up the gravitational forces exerted by all other particles j on that 
particle. However, this process needs oc force evaluations. Two different 
approaches have been proposed which approximate the gravitational forces 
and which allow to handle large particle numbers efficiently. The probably 
most attractive approach is the hierarchical tree method (Appel 1981; Porter 
1985, Barnes & Hut 1986; Hernquist 1987), because it preserves one of the 
main advantages of SPH, namely being grid-less. In this method neighbour- 
ing particles are systematically grouped to clusters, neighbouring clusters to 
clusters of clusters and so on, until only one big cluster, the so-called root- 
cluster containing all particles is left. This hierarchy of clusters is organized 
in a tree. The center of mass (COM) and the monopole and quadrupole of 
each cluster is calculated. The force on a particle i is then obtained as follows. 
Beginning with the root-cluster one checks, whether the COM of the cluster 
is well separated from the particle f.e., 

— < 0 , ( 6 . 11 ) 

^ ik 

where Sk and rik are the size of the cluster k and its COM-distance to the 
particle respectively. Typically, the opening angle 0 = 0.8. It determines the 
accuracy of the scheme: Small values of 6 yield more accurate results, but also 
require more computational time, whereas a vanishing 0 corresponds to the 
original direct summation scheme. If the cluster is well separated, the force 
on particle i is calculated by using the monopole and quadrupole moment of 
the cluster k. Otherwise, the cluster k is divided into its sub-clusters and the 
previous procedure is repeated. The whole process requires a computational 
time proportional to A/' log A/'. 

To construct the tree, two different methods are commonly used. The 
Barnes & Hut (1986) algorithm (henceforth BH-tree) initially assumes a cube 
containing all particles of the system. This cube is divided into 8 sub-cubes, 
each sub-cube into another 8 sub-sub-cubes, and so on. This procedure is 
continued until each sub-cube contains only one or no particle (empty cubes 
are not stored). All particles inside a sub-cube of higher level are combined 
to a cluster. As noted by Hernquist (1990) both the tree construction and 
the tree walking as well as the force summation can be vectorized. The other 
approach is a binary tree based on the relationship of mutual nearest neigh- 
bours (Appel 1981; Porter 1985; Benz et ai. 1990). In a first step the nearest 
neighbour of each particle i is determined. Mutual nearest neighbours are 
replaced by a cluster, whose COM, size, monopole and quadrupole moment 
are calculated. In the next step the system of the still unpaired particles and 
the already constructed clusters is considered. Again mutual nearest neigh- 
bours are combined to clusters. The procedure is repeated until only the 
root-cluster remains. 

A comparison of these two tree construction methods is subject of a paper 
by Makino (1990b). The results can roughly be summarized as follows. Tree 
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walking and force calculation can be vectorized in both methods in a similar 
way. Therefore, the force calculation needs approximately the same time in 
both methods. The BH-tree seems to be a bit faster, while the accuracy of 
the binary tree is somewhat higher. However, whereas the tree construction 
can also be vectorized for the BH-tree, this is not possible for the binary 
tree, mainly because the time consuming part of getting the mutual nearest 
neighbour cannot be vectorized if the method of Benz et al. (1990) is used. 
Therefore, the tree construction time is comparable for both methods on 
workstations, but about ten times larger for binary trees on vector machines. 
However, in contrast to the BH-tree it is not necessary to construct the binary 
tree at every time step, because one can use the same tree for several time 
steps. This property of the binary tree becomes especially advantageous when 
a multiple time step scheme is used (see above). 

The fact that the tree is newly constructed at each time step leads to a 
discontinuous change of the error in the force determination. Therefore, hier- 
archical tree methods are inadequate when used together with higher-order 
time integration schemes. Moreover, the forces of distant particles are only 
approximated by a multipole expansion of the force of the cluster to which 
the particles belong. Upper bounds on the largest possible errors that are in- 
troduced into a tree-code calculation by use of various multipole acceptability 
criteria have been derived by Salmon & Warren (1994). Because of the finite 
multipole expansion, the forces between two particles i and j in a tree-code 
calculation are no longer anti-symmetric in i and j, and thus violate both the 
conservation of linear and angular momentum and the conservation of energy 
as well. However, these errors decrease with increasing particle numbers (for 
a detailed discussion see Hernquist 1987; Barnes & Hut 1988). Because tree 
construction and tree walking require much computational time on its own, 
sophisticated direct summation methods should be used for particle numbers 
less than a few thousand. 

6.2.1 The GRAPE Project. A completely different approach to perform 
large N-body simulations was chosen by Sugimoto and collaborators at the 
University of Tokyo (for an overview see Sugimoto 1993a, b): The calculation 
of one force interaction is a combination of a very few specific arithmetic 
operations: three coordinate differences, three squares, one sum, one square 
root, one inverse and three multiplications. Finally, the forces of all particles 
have to be added together. Furthermore, many of these operations do not 
depend on each other and can be done in a pipeline. Since the number of 
operations for the total force on one particle is the same for every particle, it 
is possible to parallelize it with a very good load balancing. 

Sugimoto et al. have designed a series of special purpose hardware boards 
GRAPE (GRAvity PipE) to calculate Eq. (6.7). Furthermore, a list of par- 
ticles within a sphere of a given radius hi of particle i is returned, too, 
which is especially attractive if one wants to run a SPH code on such a 
board. The board is connected with a standard workstation via a VME in- 
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terface. With a library it is possible to use GRAPE via subroutine calls in 
normal FORTRAN or C code. The prototype machine GRAPE-1 reached 
240Mflops in 1989. Meanwhile there exist three series of GRAPE boards: 
GRAPE-1 (240Mflops), GRAPE-IA (240Mflops), GRAPE-3 (15Gflops) 
and GRAPE 3A (5 Gflops/board) are low precision boards 1 % in the 
force), the precision, however, being sufficient for most astrophysicaJ appli- 
cations. GRAPE-2 (40Mflops), HARP-1 (ISOMflops), HARP-2 (2Gflops) 
and GRAPE-4 (l.lTflops) are working with 32 and 64 bit arithmetic and 
are designed to calculate molecular systems and specific stellar dynamical 
problems. Finally, GRAPE-2A (ISOMflops) and MD-GRAPE (4Gflops) 
are versions which include a user-loadable force look-up table that can be 
used for arbitrary central force laws (for more details see, e.g.,Hut 1997). 
Recently, an initial “point design study” of the feasibility of constructing 
hardwired Petaflops-class computational engines (GRAPE-6) has been com- 
pleted (Hut et al. 1997). 

In a series of publications, the Tokyo group and others have shown that it 
is indeed possible to perform large N-body simulations on such a board with 
a speed close to its peak performance (for a review see, e.g.,Hut 1997). Fur- 
thermore, simulations involving SPH and GRAPE have been performed, too 
(e.g.jSteinmetz & Muller 1995; Steinmetz 1996a, b). In these GRAPESPH 
simulations, the gravitational force and the neighbour list is obtained with 
GRAPE, while the evaluation of the hydrodynamic force and the solution 
of the equation of state is done on a workstation. Similarly, tree codes can 
be implemented on GRAPE, too. Again, the force evaluation is done on the 
board, but the tree construction and the determination of the interaction list 
has to be done on the front end. Thus, a powerful workstation is necessary 
in order to use a combination of a SPH and/or a tree code with GRAPE 
efficiently. 

6.3 Variable Smoothing Length 

In SPH the smoothing length h limits the resolution, i.e.,all structures on 
scales less than h are strongly smoothed out. In the early implementations of 
SPH the smoothing length was assumed to be fixed in space and time, and 
its size was determined by the mean density of the system in consideration 
(see also Gingold & Monaghan 1977, 1982). The upper limit of the density is 
then given when the center of all particles coincide, whereas the lower limit 
is obtained when all particles have a separation larger than 2/i, i.e., 

= mW(0, fi), 

N 

= ^mW^(0,/i). (6.12) 

i-1 

Thus, the maximum dynamic (density) range is N. However, this estimate 
is too optimistic. Firstly, one needs at least 10 particles to avoid very large 
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fluctuations in velocity, density and energy, and secondly the positions of the 
particles must not be degenerated. The latter point means that the particle 
positions have to differ, because otherwise only a density profile can result 
which is proportional to the kernel VP. Hence, the dynamic range is more 
realistically approximated by about A^/50 . . . iV/100. 

However, the resolution can be increased dramatically, if one assumes a 
spatially and temporally varying smoothing length h{r,t), which is deter- 
mined by the local instead of the mean density. Whereas the introduction 
of a time-dependent h is straightforward, there exist two possibilities to in- 
troduce a spatially variable h, the so-called gather and scatter interpreta- 
tion (e.g., Hernquist & Katz 1989): To guarantee the conservation laws of 
energy, momentum and angular momentum, a mixed interpretation is com- 
monly used. Benz (1990) has symmetrized the smoothing length by using 
Wij — VP(ri — Tj, {hi -h /ij)/2), while Hernquist & Katz (1989) symmetrized 
the kernel itself Wij = 0.5(VP(rj — Vj^hi) -1- W{ri — Note that if 

the smoothing lengths of two particles are very different, the corresponding 
symmetrized kernel of Benz (1990) has an effective width only half as large 
as that proposed by Hernquist & Katz (1989). Steinmetz & Muller (1993) 
have tested both methods and got similar results in most cases. However, 
the symmetrization of Hernquist & Katz (1989) shows a somewhat better 
damping of post-shock oscillations in the case of strong shock waves. 

Using a variable smoothing length can lead to a violation of the fluid con- 
servation laws (Hernquist 1993; Nelson & PapaJoizou 1994), because terms 
proportional toVh and dhfdt occur in the equation of motion (e.g., Hernquist 
& Katz 1989; Benz 1990). In extreme situations the violation of the conser- 
vation laws can reach the « 10% level (Hernquist 1993). Hence, SPH sim- 
ulations done with a variable smoothing length should be interpreted with 
caution. Test calculations, designed to compare the effects of including and 
ignoring the V/i terms (Nelson & Papaloizou 1994) show that their inclusion 
has no detrimental effects on the ability of SPH to model known problems 
(shock tubes, adiabatic collapse of cold gas spheres) with reasonable accuracy, 
but leads to a dramatic improvement on problems where SPH has shown to 
perform badly because of poor energy conservation (e.g, head-on collision be- 
tween polytropes). The SPH algorithm is also no longer a priori second-order 
accurate when a variable smoothing length is used (Benz 1990). 

One problem still remains, namely how does one determine h? Because h 
has the dimension of a length, it is quite natural to couple h to g{r) via 

where ^ is a paraimeter of order unity. However, then the problem arises 
that h must be known in order to compute g and vice versa. Miyama et 
al. (1984) have solved this problem using the density of the previous time step. 
Benz (1990) and Evraxd (1988) propose another approach, where Eq. (6.13) 
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is differentiated with respect to time and the continuity equation is used to 
rewrite the resulting right hand side: 

I = iw. . (6,14) 

The smoothing length h is then evolved in time similar to any other hydrody- 
namic quantity. Alternatively, one can couple h to the number of neighbouring 
particles A'neighbour within a sphere of size 2h and try to keep this number 
fixed (Hernquist Sz Katz 1989). However, this method is quite time consum- 
ing, because a complete tree- walk is necessary to determine ^neighbour- This 
is partially circumvented by allowing A^neighbour to vary within a given toler- 
ance. 

In their test calculations Steinmetz & Muller (1993) have found that the 
simple procedure of Miyama et aJ. (1984) to update the smoothing length 
leads to instabilities in many cases. The approaches of Benz (1990) and of 
Hernquist & Katz (1989) give quite good results for one-dimensional prob- 
lems, or when the particle positions almost resemble a regular grid. But in 
the case of a randomly sampled three-dimensional problem, the smoothness 
of h is not guaranteed and the following effects are observed: Small density 
fluctuations lead to fluctuations in h, which in turn amplify the density fluc- 
tuations and so on. This effect is also well known in finite-difference methods 
when adaptive grids are used. In order to avoid it, one should determine h 
from global trends in the motion of the fluid, and not from local features. 

In the method of Hernquist & Katz (1989), the number of neighbours 
A'neighbour is a free parameter. Thus it is an open question, whether the 
instabilities observed by Steinmetz & Muller (1993) occur for all choices of 
A^neighbour OF whether there exists a critical value of A^neighbour above which 
the method is stable. In the latter case it is unclear whether this value of 
A^neigbour is problem-dependent or not. Note in this context that Balsara 
(1995) has argued that the number of neighbours commonly used in SPH 
simulations (A^neighbour ^ 40) probably is too small for a correct description 
of the propagation of sound waves. Considering the above problems Steinmetz 
& Muller (1993) have proposed a more satisfactory algorithm to update the 
smoothing length, which gives remarkably good results. 

6.4 Time Integration, Initial Model 

In most SPH implementations, time integration is performed with a second- 
order accurate scheme, which consists of a multiple time step leap-frog inte- 
grator. Velocities can be predicted as described in Hernquist & Katz (1989). 
As noted by Anzer et aL (1987) and by Monaghan & Vaxnas (1988) this 
scheme has a very slowly growing instability, if the viscosity parameter 
a ^ 0.1 (Eq. (6.9)). However, this instability is not very troublesome, because 
in many astrophysical applications shock waves occur, i.e., artificial viscosity 
has to be switched on. The use of multiple time steps is very effective the 
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computational times being reduced at least by a factor of two. In the case 
that a highly condensed subregion evolves, the reduction in computational 
time can exceed a factor of ten or even more! 

To get an initial model one can use the method proposed by Lucy (1977). 
The N particles are sampled randomly according to the density distribution 
g{r) one wants to model. The particles are assigned a mass m and a specific 
energy according to the energy profile E{r). However, the number of par- 
ticles which currently can be used in a simulation are much too small and 
do not allow for a satisfactory approximation of g and E, i.e.,the presence 
of large pressure gradients in an initial model gives rise to random motion 
with large kinetic energies. Following Lucy (1977), one should then allow the 
random motion to relax into equilibrium by solving the equation of motion 
with an additional damping term. However, the relaxation method has the 
severe disadvantage that the star needs several sound crossing times to reach 
the equilibrium state. Especially in simulations with a large number of parti- 
cles the relaxation process can thus become as time consuming as the whole 
simulation itself! On the other hand, the same initial model is often used for 
a whole sequence of simulations, so this disadvantage is not very severe. For 
non-equilibrium initial models, one can skip the relaxation process provided 
the motion rapidly becomes highly dynamic, i.e., Ekin > |£^pot|, -Eint- For 
moderately dynamic systems, the stretched grid method is to be preferred 
(e.g., Hernquist & Katz 1989). 

It is important to note, that whatever method is used, one has to check 
that all physical quantities (in particular the energy and the velocity) of the 
initial model are smooth. Note that the relaxation process of Lucy (1977) 
guarantees such a smooth behaviour, because during the relaxation process 
the SPH algorithm is already applied to the different physical quantities. 
Otherwise, the simplest way to achieve this smoothness is to apply the SPH 
formalism to the initial model (see also Hernquist & Katz 1989; Benz 1990). 
Disregard of this point gives rise to unphysical oscillations close to steep 
density gradients and to an unstable behaviour (e.g., small-scale oscillations 
in energy and density) of almost hydrostatic models. 

6.5 Computational Aspects 

For SPH codes being based on the hierarchical tree method two strategies ex- 
ist to vectorize the algorithm. Hernquist (1990) proposes to vectorize over all 
nodes of a level in the tree during tree walking and over all force terms during 
the force calculation. The resulting typical vector length is short (between 
some tens and some hundreds). In Makino’s (1990a) method, all particles 
walk through the tree in parallel and one force term is added to all particles 
in parallel, too. This results in a much larger vector lengths (about N) but re- 
quires more bookkeeping and thus complicates the structure of the program. 
Test calculations by Steinmetz & Muller (1993) show that the computational 
time of both methods is practically identical on CRAY vector computers, as 
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these reach a large percentage of their maximum theoretical speed even for 
short vector lengths. Thus, because of its simplicity and greater flexibility, 
Hernquist’s (1990) method should be used to vectorize the force calculations. 
Barnes (1990) proposed an alternative vectorization method, which is based 
on the fact that nearby particles have similar interaction lists. Therefore, one 
can use the same interaction list for a group of particles. However, as a price 
for the increased degree of vectorization one has to compute a larger number 
of force terms, which makes the method less efficient than the methods of 
Hernquist (1990) or Makino (1990a). 

The typical CPU-time required for a binary tree based SPH simulation of 
a few times 10^ particles and 10^ time steps ranges from a few hours to up to 
20 hours on one processor of a CRAY-YMP (333 Mflops peak performance). 
The CPU- time scales almost linearly with the number of particles. 

Finally, concerning cosmological studies it should be noted that a colli- 
sionless dark-matter component can be included into a SPH simulation in 
an easy and consistent way as an additional kind of particles which interact 
purely gravitationally. 

6.6 How dissipative is SPH? 

One of the main advantages of one-dimensional Lagrangian finite difference 
methods is their small numerical diffusion, which results from the fact that in 
Lagrangian coordinates no advection terms are present in the hydrodynamic 
equations (see Sect. 2.2.1). This attractive property makes Lagrangian meth- 
ods optimally suited to handle problems involving chemical or nuclear kinet- 
ics (see Sect. 4). Unfortunately, this property is lost in multi-dimensional flow 
problems, because the appearance of shear flows and/or vortices repeatedly 
requires remapping the tangled and twisted grid. This remapping introduces 
some amount of numerical diffusion into most multi-dimensional Lagrangian 
schemes, which exceeds that of “state of the art” Eulerian schemes. 

Concerning the diffusivity of SPH the situation is more complicated. On 
the one hand according to the Monte- Carlo nature of SPH each physical 
quantity is determined by a weighted superposition of neighbouring parti- 
cles with both neighbours and weights changing from time step to time step. 
Moreover, the SPH particles can penetrate each other. These two effects can 
be looked upon as a kind of intrinsic remap of a free Lagrangian grid, which 
would imply a sizeable numerical diffusivity. On the other hand SPH should 
be less diffusive than Eulerian methods, because due to its Lagrangian na- 
ture mass, energy and velocity are particle properties which need not to 
be advected. This conception is supported by the large amount of artificial 
viscosity required in the shock tube calculations to damp-out post-shock os- 
cillations. However, shock tube calculations are usually performed with a 
one-dimensional version of SPH where the particles are initially on a regu- 
lar grid, i.e., particles hardly penetrate each other and the neighbours of a 
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Fig. 6.1. Central density of a pulsating n = 1.5 polytrope as a function of time 
normalized to the central density of the equilibrium model. The pulsation is initiated 
by a spherical homologous contraction of the equilibrium configuration by a factor 
of 0.8 in radius. In the upper calculation h is constant in space, while a spatially 
variable smoothing length was used in the lower simulation. Obviously, the increase 
in resolution due to a spatizilly variable smoothing length is accompanied by a larger 
amount of numerical diffusion. (From Steinmetz &: Muller 1993) 



specified particle remain almost unchanged. Thus, the size of the numerical 
diffusivity of SPH remains unclear. 

In order to clarify the diffusivity issue Steinmetz & Muller (1993) have 
examined the development of strong pulsations of a n = 1.5 polytrope. The 
pulsation is initiated by imposing a homologous contraction on an equilib- 
rium model. The resulting evolution of the central density is shown in Fig. 6.1 
for a simulation with a spatially variable (top) and a spatially constant (bot- 
tom) time-dependent smoothing length. Obviously, the pulsation amplitude 
decreases in both simulations. Approximately one half of the observed de- 
crease is due to the artificial viscosity required to stabilize the model, while 
the other half is due to the SPH inherent numerical diffusivity. Note that 
the radial pulsation does not decay into non-radial modes, but that kinetic 
energy is rather converted into thermal energy. Furthermore, at first glance 
it seems surprising that the model with the spatially constant smoothing 
length is less diffusive than that with the variable h. However, this can be 
explained. If h is spatially constant, the density contrast within the star can 
only be approximated by varying the number of particles as a function of ra- 
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dius, while for a spatially variable smoothing length both the number and the 
size can be adjusted. Thus, for a spatially constant h contributions of many 
more particles are required to approximate the central density than for a 
spatially variable h, i.e., a change in the number of neighbours is statistically 
more significant in the latter case. The resulting energy diffusion is mainly 
determined by the diffusion in the innermost parts of the star, because the 
thermal energy grows more than linear with density. This interpretation is 
proven by the fact, that when Steinmetz & Muller (1993) used an explicit 
adiabatic model where the pressure is directly computed from the density 
(i.e., no energy equation is used), the numerical diffusion of the scheme was 
reduced drastically. 

6.7 How Well Does SPH Treat Shocks? 

Astrophysical flows often involve shock waves. Thus, it is important that 
shocks are accurately described by the numerical code used for the simula- 
tion of the flow. The respective requirements on the code fall into two different 
categories. In the first category of problems the structure of the shock front 
itself is important, i.e., a high spatial resolution of the discontinuity is crucial. 
Typical examples for this category of problems are hydrodynamic flows with 
nuclear burning, where an insufficient spatial resolution can lead to quan- 
titatively very inaccurate and in some cases to even qualitatively incorrect 
solutions (see Sect. 4). In the second category of problems the time scales 
of processes triggered by the shock wave are comparable or larger than the 
hydrodynamic time scale. Then mainly an accurate description of the two 
states on both sides of the shock is important, while the structure of the 
discontinuity matters less. 

In early publications about SPH (Gingold & Monaghan 1982; Monaghan 
1985; Loewenstein & Mathews 1986) it was shown, that SPH is able to solve 
Riemann or shock tube test problems in a satisfactory way. However, in 
publications dealing with finite volume methods, Riemann problems are only 
looked upon as a kind of trivial test problems, which are not demanding at 
all. For this reason Steinmetz & Muller (1993) have tested their SPH code 
on a more severe hydrodynamic test problem, namely the interaction of two 
strong shock or blast waves (Woodward 1982), and have compared their SPH 
results with those obtained with the Piecewise Parabolic Method (PPM) of 
Colella & Woodward (1984). 

The initial state of the interacting blast wave problem consist of three 
constant states of a 7 = 1.4 gas in the interval 0 < rr < 1 with reflecting 
boundaries on both sides. The density is unity everywhere and the pressure 
is assumed to be 

1000, 0 < a: < 0.1 , 

0.01, O.K a: < 0.9 , 

100, 0.9 < X < 1 . 



P(x) = 



(6.15) 
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The initial conditions give rise to two shock waves propagating into the 
cold (i.e.jlow pressure) gas and two rarefaction waves moving into the hot 
(i.e.,high pressure) gas. The Mach numbers M of the two shocks are very 
large in the left and right shock the Mach numbers being M ^ 170 and 
M ^ 51, respectively. The reflection of rarefaction waves at the boundaries 
and at contact discontinuities, the weakening of the shocks caught by the 
reflected rarefactions, and finally the shock collision lead to a quite complex 
behaviour (for more details see Woodward 1982; Woodward & Colella 1984). 
Up to the time when the two shocks collide the density and velocity profiles 
can be described by a set of straight lines. 

In the SPH calculations of Steinmetz & Muller (1993) the initial state was 
obtained putting the particles on a uniform grid with spacing h. The bound- 
aries are simulated using “ghost-particles”, i.e.,for each particle extending 
across a boundary an analogous particle is created on the other side of the 
boundary having the same distance to the boundary, but the opposite veloc- 
ity. The smoothing length is time-dependent and spatially variable according 
to the method of Benz (1990; see also Eq. (6.14)). The evolution of the den- 
sity and the velocity field are shown in Fig. 6.2 (for N — 400) and in Fig. 6.3 
(for N = 2000). In both figures the times of the snapshots are chosen from 
Woodward & Colella (1984) and hence correspond to (dimensionless) times 
equal to 0.010, 0.016, 0.026, 0.028, 0.030, 0.032, 0.034 and 0.038, respectively. 

In the low resolution run the shock is quite accurately resolved at the 
beginning of the calculation (Fig. 6.2; snapshots 1-3). However, the transitions 
between constant states and rarefaction waves are rather smooth and not very 
sharp. Moreover, some oscillations in the constant state between the shock 
and the contact discontinuity as well as in the rarefaction waves are visible 
(Fig. 6.2; snapshots 2-3). But the states themselves are well represented and 
are even quantitatively in agreement with the results of Woodward & Colella 
(1984). All shortcomings of the low resolution run are removed in the high 
resolution run (Fig. 6.2; snapshots 1-3). Except for a very narrow region at 
the edges of the different states the profiles seem to be represented by a set 
of straight lines. 

At t = 0.28 (Fig. 6.2 and 6.3; snapshot 4) the two blast waves collide near 
X = 0.7 giving rise to a very narrow density maximum with « 30. In the 
low resolution run the maximum density is overestimated by about 15%, and 
in the high resolution run it is still too large by more than 5%. According 
to Woodward & Colella (1984) the density maximum should decrease and 
broaden with time. This behaviour is reproduced by the SPH results (Figs. 6.2 
and 6.3; snapshots 4-8), but they also show an additional erroneous density 
spike immediately to the left of the density maximum which is most clearly 
seen in the snapshots at late times (Figs. 6.2 and 6.3; snapshots 7-8). Also 
at late times one recognizes another incorrect density feature slightly to the 
right of the density minimum at x = 0.75 where the density undershoots. 
Both defects of the solution are caused by the usage of a variable smoothing 
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Fig. 6.2. Snapshots of velocity (top two rows) and density (lower two rows) for 
the Woodward double shock tube problem computed with SPH using 400 particles. 
The states to the left and to the right of shocks and contact discontinuities cLre ac- 
curately reproduced. However, the discontinuities themselves are strongly smeared 
out. Furthermore, some small post-shock oscillations are visible. (Prom Steinmetz 
& Muller 1993) 
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Fig. 6.3. Same as Fig. 6.2, but using 2000 particles. No post-shock oscillations 
are present and the discontinuities axe much more crisp. Only a small overshoot 
in density is still visible where the two shock waves collided. (Prom Steinmetz & 
Muller 1993) 
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length. The large density jump occurring during the shock collision yields 
a very small smoothing length for particles in the collision region, i.e.,the 
resulting intrinsic density of a particle ^intr = 2m/37r/i is extremely large 
(for a detailed discussion of this point see Loewenstein & Mathews 1986; 
Evrard 1988). 

In the low resolution run some small additional oscillations occur near 
the density minimum between the two maxima (Fig. 6.2; snapshot 8). All 
features of the solution of Woodward & Colella (1984) are present in the 
N ~ 400 calculation, although some of them are strongly smeared out, as 
e.g.jthe discontinuity at :r 0.85 (Fig. 6.2; snapshot 8). In the calculation 
using 2000 particles the oscillations are not present. All features are much 
more crisp, and the results are comparable with those of quite sophisticated 
finite volume methods like for example FCT (Flux-Corrected Transport). 
However, the quality of PPM cannot be reached. 

Thus, in conclusion, it is possible to handle even demanding hydrody- 
namic test problems involving strong shocks and their interaction with SPH 
provided a sufficiently large number of particles is used in the simulation. In 
my opinion, SPH and finite volume methods should be looked upon not as 
competing but as complementary methods to solve the hydrodynamic equa- 
tions each of which has its own merits and defects. 
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a-Network 435 
/9-decay 420 
/9-equilibrium 362 
7 -rays 372, 376, 405 

Aberration 179 
Absorption mean 227 
Absorptivity, invariant 181 
Abundance profile 437 

- corrupt 437 

Accretion shock 326,329,351, 
368-370, 386 

Accuracy see also Error, 351, 422, 424 

- adaptive grid and 256 

- assessment of 111 

- better thain second-order 78, 79, 81 

- global 45 

- loss of 74, 128, 138 

- monitoring 249 

~ timestep and 243, 249 
Acoustics see also Wave, acoustic 

- one’dimensional 7 

- two-dimensional 102 

Active gcilaetic nuclei (AGN) 437, 439 
Adaptive derivative 274 
Adaptive mesh refinement (AMR) 20, 
21,95,105-111,147, 428,455 
Adaptive transport theorem 275 
Adiabatic expansion 317, 330 
Adiabatic index, effective 272, 273, 
365 

Adiabats 42, 140 
ADM formalism 144 
Advection 282 
Advection equation 64, 235 

- for V • B 132 

- for unburnt gas 92 

- linear 7, 10, 22, 43, 50, 57, 64, 66 

- nonlinear 22, 23 

- two-dimensional 102 
Advection-diffusion equation 11 



Advection-reaction equation 85 
Aerodynamics 10, 80, 104, 291 
Alfven wave 122-124 
Algorithm 

- Reconstruct-Sol ve- Aver age (RSA) 
65 

Alpha particle chain 423 
Amplification faetor 235 
Amplification matrix 236, 237 
Arrhenius relation 90 
Artificial resistivity 127 
Artificial viscosity 56, 238, 239, 243, 
253-255, 263, 285, 335, 347, 453 

- coefficient 285 

- in SPH 464, 466, 467 

- length scale 297, 299, 329 

- tensor, in SPH 467 

Asymmetric time-filtering 300 
Atwood number (At) 416 
Avogadro number (A^a) 420 

Beam diameter 446, 450, 451 
Beam flow velocity 446-449, 452, 453, 
458, 460 

Beam scheme 80 
Beauty of flux-differencing 65 
Black holes 142, 146, 251, 353, 360, 
393, 438-440 

- collision of 143 
Boltzmann 

- equation 187, 206 

- number 178 

Bounce see also Core bounce 

- centrifugal 387-390 
Boundary conditions 76, 242 

- appEirent horizon 146 

- divergence cleaning and 129 

- extrapolation 83 

- fractional step method and 90, 101 

- grid 295 

- inner 284, 378, 381 
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- internal 238 

- Lagrangian 284 

- nonreflecting 83 
outer 284 

- outflow 83, 330, 454 

- periodic 83 

- radiation intensity 291 

- solid wall 84 

- SPH 477 

- symmetric 84 

Boundajy- value problem (BVP) 6 
Bow shock 446, 447, 461 

- nested 455 
perturbation 458 

- reflection at boundary 454 

- secondary 450 

Burgers’ equation 22, 23, 25, 55 
Burning 

- carbon 419 

- hydrogen 419 

- hydrostatic 409 

- nuclear 431 

- oxygen 365, 419 

- oxygen shell 427 

- Si-core 364, 365 

- Si-shell 365 

” silicon 424, 427 
Bursts 

- 7 ~ray 253 

- hard X-ray 253 
“ soft X-ray 253 

Carbuncle 118 
Cauchy problem 22 
Central density 360, 366, 391, 399, 475 
Centrifugal hang-up 402 
CFL condition 45, 50, 52, 59, 236, 237, 
243, 256, 263, 278, 350 
Chandrasekhar mass 361, 363, 364 
Chapman-Enskog theory 166, 177, 
223 

Chapman-Jouguet detonation see 
Detonation, Chapman-Jouguet 
Chapman-Jouguet velocity 410, 434 
Characteristic variables 27 
Characteristics 22, 25, 28 
Characteristics, method of 238 
Chemical evolution 272 
Chemical reactions 50, 90, 91, 337 
Closure problem 177, 268 
Closure relation 166, 269 

- radiative 288 
CNO cycle 408 



Cocoon 445, 446, 449, 450, 457 

- absence of 450, 459 

- KH-instability in 450 

- mass of 459, 460 

- of cold jets 456 

- of relativistic jets 456 

- overpressured 456, 461 
Cocoon prominence 449 
Codes 3 

- adaptive MHD 106 

- AMRCLAW 4, 108 

- CLAWPACK 4, 82, 104, 108, 139 

- PROMETHEUS 429, 432, 433 

- TITAN 259 

- VERA 230 

- WH80s 258, 259 

Colliding winds 20, 21, 108, 109, 117, 
118 

Column mass 241 
Combustion model 90 
Comet 106 

Composition interface 373, 376 
Computer 

- CRAY-YMP 474 

- CRAYT3E 348 

- GRAPE (GRAvity PipE) 469 

- memory 348, 351 

- parallel 348, 352, 431, 464 

- resources 348 

- vector 431, 473 

- workstation 308, 325, 337, 348, 353, 
463, 469, 470 

Conservation form 53, 54, 426, 432 

- in 2D 103 

- MHD equations in 120 
Conservation law 8 

- differential form of 10 

- in spacetime 133 

- integral form of 11 

- multidimensional 14 

- relativistic momentum-energy 137 

- weak form of 24 

- weak solution of 24 
Conservation of species 429 
Consistency 45, 52 

Contact discontinuity 15, 36, 41, 121, 
123, 141, 313, 352, 446, 447, 458 

- detection 352, 429 

- propagation of 352 

- slowly moving 433, 434 

- uncharacteristically wide 436 

- with SPH 478 
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Continuity equation 9, 133, 134, 136, 
209 

Convergence 45, 52 
Convexity assumption 22 
Cooling curve 316, 317 

- Kahn’s approximation 316 
Coordinate shock 146 
Coordinates 

- curvilinear 172 

- Eulerian 349 

- Lagrangian 349 
Core 

- homologous 363, 366, 367 

- inner (IC) 363, 366 

- iron 360, 361, 364, 388 

- outer 363, 366, 368 
Core bounce 251, 367, 387 

Core collapse 143, 360-363, 365, 368, 
373, 386-388, 393, 404 

- rotational 386, 389, 398 
-- axisymmetric 398,404 

- - non- axisymmetric 402-404 
Cosmic ray 

- galzictic 312 

- pressure 313, 318 
Coulomb radius 406 
Courant number 57, 59, 278 
Courant-Priedrichs-Lewy condition 

see CFL condition 
Covariant 184 
CPU time 278, 308, 337, 474 

- savings of 270, 378 
Crash of program 112, 463 
Cross section 406, 420 

- Coulomb 406 

- nuclear 407 

- Thomson 407 
Cygnus A 461 

Deflagration 91,411,412 
Delayed explosion 369, 376, 386, 415 
Deleptonization 362, 378, 380 
Democratic approach 293 
Density inversion 253 
Detonation 90, 409 

- cellular 410,411 

- Chapman-Jouguet 91,410,412 

- galloping 410 

- spinning 411 

- strong 410 

- wave 409-411 

- weak 410 

- ZND structure 91,411 



Diffusion 219 

- equilibrium 220, 224 

- first-order 222 

- multigroup 220, 248 

- nonequilibrium 220, 226 

- numerical 349, 474, 475 

- radiative 253 

- second-order 220, 224 

- zeroth-order 221 

Diffusion equation see Heat equation 
Diffusion limit 177, 216 

- dynamic 191-193, 197, 211 

- static 192, 193, 197, 211 
Diffusion terms, discretization of 281 
Dimensional splitting 103 
Discretization 347, 350 
Dissipation scale 416 
Distributed burning regime 414, 415 
Divergence cleaning 128, 132 
Domain of dependence 50, 51 
Doppler beauning 441 

Doppler effect 169 

- transverse 441 
Doppler shift 179, 186, 189 
Dust condensation 330 
Dust-induced /c-mechamism 331 

Eddington 

- approximation 253 

- factor, variable 166, 177, 249, 
269-270, 288, 291, 326, 332, 333 

- flux 163 

- tensor 166, 177, 219 
Effective aicceleration 375 
Efficiency, computational 351 
Effort 

- finding best procedure 351 

- observational and theoretical 438 

- simulation vs analysis 348 
Einstein equations 143, 147 

- hyperbolic formulation of 143 

- vacuum 144 

Elasticity, equations of 122, 127 
Electron capture (EC) 361-365, 388, 
420, 427 

- vs photo-disintegrations 365 
Elliptic constraints 

- in general relativity 144 

- in MHD 128 

Elliptic equation 5, 50, 128 
Emission coefficient 169, 171 
Emissivity 181 
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Enthalpy, specific 137 

- relativistic 137 
Entropy condition 35 
Entropy fix 61, 63, 64 

Equation of state (EOS) 10, 363-365, 
388, 389, 406, 432-436, 455 

- absence of 129 

- caloric 219, 243 

- global 130 

- isothermal 13 

- mechanical 219, 243 

- modification by burning 91 

- polytropic 40 

- soft 400 

- stiff 364, 366, 367, 388, 400, 401 
Error see also Unphysical solution 

- advection 299, 322 

- detonation caused by 432, 433, 435 

- discretization 334, 347 

- estimate 108 

- from non-smooth grid 258 

- from omission of 0(v/c) terms 191, 
198, 216 

- from rapid variation in opacity 248 

- global 45 

- in V • JB 128, 132 

- in shock speed 55 

- interpolation 306 

- local truncation (LTE) 45, 48, 240 

- mixing of species 428 

- non-conservation of species 429 

- numerical 351 

- programming 350 

- reaction network 406, 430 

- rounding 118 

- SPH 467 

- splitting 87, 88, 96, 103 

- start-up 114 

- tree code 469 

- truncation 347 

- with conservative energy equation 
335, 432 

Euler equations 1, 10, 15, 35 

- isothermal 13 

- multidimensional 102, 426 

- one-dimensional 13 

- relativistic 132, 135 

- one-dimensional 138 

- with radiation 96 

Euler expansion formula 274 
Explicit diflFerencing 422 
Explicit method 46, 219, 235, 240, 349, 
350 



Explosion mechanism 376 
External force 12, 346 
Extinction coefficient 169 

Fanaroff-Riley Class (FR) 440 
Finite difference method see Method, 
finite difference 

Finite volume method see Method, 
finite volume 

Flamelet regime 413, 414 
Floating point operations (flops) 348, 
352,470,474 
Flow 

- atmospheric 49 

- incompressible 6, 9, 50, 129, 131 

- multi-dimensional relativistic 141 

- quasi-stationary 96, 97 

- reacting 12 

- relativistic 60, 135 

- relativistic vs Newtonian 140 

- self-gravitating 6 
Flux 9 

Flux function 7, 8 

- Godunov 60 

- Lax-Wendroff vs Godunov 75 

- numerical 54, 56, 71 
Flux limiter 68, 72 

- for linear system 74 

- for nonlinear system 77 

- MC 74 

- minmod 74 

- superbee 2, 74, 114, 115, 139 

- TVD 73 

- van Leer 74 

Flux limiting, in radiation diffusion 
232 

Foliation of spacetime 143, 145 
Forward differencing 422 
4-flux 133 
4-force 

- external 194 

- radiation-matter 194 
4-momentum 136 

4- velocity 136 
FR I source 440, 442, 443 
FR II source 440-442, 460 
Frame 

- comoving 169, 170, 179, 199 

- inertial 185, 187 

- mixed 189, 193 

- noninerti 2 il 188, 199 

- proper 199 

Frequency coupling 169, 247, 251 
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Front fitting 254 
Front tracking 95, 105 
Front velocity of detonation 409 
Front, unresolved 248, 249 

Galaxies 

- M81 373 

- M87 439,440,442 

Garbage, numerical 238, 422 
General relativity see Einstein 

equations 

Genuine nonlinearity 22, 35 

- loss of 36, 124, 127 
GEO600 392 
Geodesics 188, 199 
Geometric source term 12, 102 
Ghost cells 82 
Ghost-paxticles 477 

Gibson scale 417, 418 
GIFT 431 

Godunov splitting 89, 90 
GRAPE project 469 
Gravitational force 18, 96, 100, 101 
Gravitational radiation 143, 379, 381, 
393-398, 402 

“ energy spectrum of 400, 401 
Gravitational waves 

- amplitude 397,398,401,404 

- astrophysics 143, 393 

- detector 143, 392 

- signal 380,388,389,393,396-405 

- total energy of 395-399, 402 
Grid 

- adaptive see also Adaptive mesh 
refinement (AMR), 248, 255, 351, 428 

- - convection 338 

-- expansion formula 257 

- - multidimensional 338 

- - time filter 259 

- transport theorem 257 

- comoving 349 

- concentration 258 

- coordinates 199 

- equation 258, 263 

- generation 104 

- implicit adaptive 255, 256 

- inertial 199 

- Lagrangian 199, 219 

- motion, memory of 300 

- nested, Cartesian 404 

- refinement see Adaptive mesh 
refinement (AMR) 

- remapping 474 



- resolution 292 

- rezoning 256, 349, 428 

- smoothing 

- - spatial 293 

-- temporal 294,313,327 

- spacing 

-- critical 435 
-- logarithmic 313 

- staggered 219, 277, 278, 280 

- staggered, in MHD 131 

- staggered, MAC 131 

- tangling 105, 349, 428, 474 

- tearing 313 

- velocity 256, 277 
Grid equation 294, 296 
Grid- aligned shock 117, 118 
Grid-orientation effects 103, 108, 116 

H 2 dissoziation 273 
Heat equation 5, 6, 11, 46-52 
Hiercurchical tree 464, 467-469 
Hodge projection 128 
Homogeneous of degree one 80 
Hot bubble 369, 370, 376, 380-382, 
396-398 

Hourglass Nebula 19, 20 
Hubble constant 358 
Hugoniot locus 31, 33-35, 39, 40 
Hydrodynamic approximation 346 
Hyperbolic equation 5, 7, 49 

- dissipation and 11,49 

- linear system of 27, 50, 56 

- non-strictly hyperbolic 124-127 

- nonlinear system of 1, 7, 32, 50, 77 

- strictly hyperbolic 27, 28, 32, 36, 

125 

- two-dimensional 101 
Idea 

- bad 189, 229, 239 

- brilliant 239 

- Gustavson’s 431 

- one-dimensional 1 04 

Implicit method 46, 47, 49, 52, 128, 
219, 235, 240, 307, 349, 350 
Implicit radiation hydrodynamics 412 
Inaccurate solution 432, 476 
Incorrect see Error or Unphysiccd 
solution 

Inertial range 417 
Initial model 

- atmosphere 304 

- Stax 304 
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Initial-boundary value problem 443 
Instability 

- Kelvin-Helmholtz (KH) 21, 413, 
416, 444, 445, 447, 450, 456 

- Landau-Darrieus (LD) 413, 417, 
418 

- MHD 129 

- numerical 49,116-118,422 

-- Rayleigh-Taylor (RT) 18, 253, 
374-377, 413, 414, 416-418, 450 
-thin shell 21,118 
Integral curve 35-40 
Intensity 

- invariant 180 

- mean 162 

- specific 161, 180,185,288 

- first angular moment 163 

- - second angular moment 165 

- - zeroth angular moment 162 
Interior solution 83 
Intermediate wave 123, 126, 127 
Interpolation 

- error 306 

- rational splines 272, 304, 307 
Interstellar medium 313 
Interstellar nebula 226 
Invariants 

- absorptivity 181 

- emissivity 181 

- intensity 180 

- Lorentz 179-181 

- relativistic 162 
Ionization 12, 273 

Isothermal equations 13, 33-38, 62, 
95,96, 100, 101 

Jacobi matrix 310, 336 
Jet 437 

- (kilo) parsec-scale 438 

- cold 452,456,457 

- collimation 438, 449, 455 

~ - by internal shocks 449-451 

- - perfect 443 

- extragalactic 345, 438, 439 

- heavy 446 

- highly supersonic 452 

- hot 452-454,456,457,459 

- kinematically relativistic 452 

- light 446 

- Newtonian vs relativistic 460 

- one dimensional 447, 448 

- one-sided 440-442 

- parsec-scale 462 



- propagation, efficiency of 458-460 

- relativistic MHD 455 

- simulation of 444 

- stability of 449 

- sub-parsec-scale 438 

- thermodynamically relativistic 452 
Jet boundary layer 450 

Jet velocity 446, 449 

- relativistic 458 

Kmlovitz number (Ka) 414 
Kinetic theory 

- Boltzmann equation 187 
-- photon 206 

- distribution function 162 

- relativistic 179 
Kirchhoff-Planck relation 170 
Kiss it goodbye 230 
Kolmogorov scaling law 417 

Lab frame 132 
Lagrange multiplier 130 
Lagrangian switch point 334 
Laing-Garrington correlation 442 
Lapse 145 

Laser-fusion pellet 212 
LIGO 392 

Linear degeneracy 36, 40, 122, 125 
Lorentz factor 135, 142, 440-442, 453, 
454, 456, 458, 463 
Lorentz invariant 179-181 
Lorentz transformation 179 
LTE (local thermodynamic equilibrium) 
170 

Lucy approximation 270, 332 
Luminosity 175 

Mach disk 446, 447, 454, 455 
Mach shock 449, 456, 459 
MacLaurin spheroid 387,391,392 
Magnetic pressure 120 
Magnetic tension 120 
Marquina flux 81 
Mass fraction 420 

Mass-equivalent enthalpy density 137 
Matrix 

- block penta-diagonal 259, 284, 310 

- penta-diagonal 295, 298, 304 

- Roe 62, 131 

- sparse 424, 431 
Maxwell equations 118 
Maxwell stress tensor 120 
MC-limiter 70 

Mesh see Grid 
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Method see also Scheme 

- adaptive-grid finite volume vs SPH 
464 

- Beam-Warming 66, 67, 69, 72-74 

- BGK 81 

- conjugate gradient 6 

- Crank-Nicolson 46, 47, 49, 52 

- Donor Cell 282, 352 

- Engquist-Osher 61 

- Eulerian 428, 432, 433 

- Euleriem PPM 435, 436 

- explicit 46, 219, 235, 240, 241, 349, 
350 

- finite difference 43 

- finite volume 43, 44, 52, 263, 275 

- - for relativistic flow 139 
-- multi- dimensional 103 

- flux limiter 72 

- - for linear system 74 
-- for nonlinear system 77 

- Flux-Corrected Transport (FCT) 

68 

--vs SPH 480 

- flux- difference splitting 80 

- flux- vector splitting 80 
-- Marquina’s 81,113,118 

- Steger- Warming 80,81 

- flux-vector splitting (SADIE) 352, 
432,433,435 

- forward Euler 79, 86 

- fractional step 84, 86, 94, 147 

- fractional step and combustion 94 

- fractiou 2 d step and gravitation 101 

- fractional step and radiation 96 

- free-Lagrange 349 

- Fromm’s 66, 67, 71-74 

- Godunov’s 1, 2, 42, 45, 57, 59, 60, 
64, 66, 74, 77, 96, 352 

-- generalization of 65 

- hierarchical tree 464, 467-469 

- high-resolution 2, 52, 64, 77 

- hybrid 68 

- implicit 46, 47, 49, 52, 219, 235, 240, 
241, 278, 307, 349, 350 

- for MHD equations 128 

- implicit hybrid 337 

- Lagrangian 428, 463, 474 

- Lagrangian PPM 435, 437 

- Lax-Friedrichs 56, 81 

- Lax-Wendroff 1, 2, 44, 45, 51, 
66-75, 85, 352 

- leapfrog 44, 45 

- Monte Carlo 463 



- multi-dimensional 103 

- multifrequency/grey (MFG) 230, 
247 

- multigrid 6 

- multistep 46 

- Newton’s 424 

- nonconservative 55, 111 

- nongrey 337 

- of characteristics 238 

- one-step 46 

- piecewise parabolic (PPM) 68, 282, 
352,429,432-434, 444 

-- vs SPH 476,480 

- positively conservative 112 

- PPM, relativistic 139 

- projection 129 

- pseudo-spectrail 404 

- Runge-Kutta 78, 79, 87 

- semi-discrete 78, 79 

- shock-capturing 52, 105 

- shock-capturing, multidimensional 
116 

- slope limiter 68-71 

- for linear system 74 

- - for nonlinear system 77 

- SPH 348,349,463 

- splitting see Splitting 

- time centered 280 

- total variation diminishing (TVD) 
69, 73, 74, 444, 455 

- trapezoidal 87 

- TVD Runge-Kutta 79 

- TVD, failure of 115 

- two-level 46 

- unsplit 84, 85, 104 

- upwind 43, 45, 58, 66, 70, 73 
-- conservative 55 

-- first-order 64,69,352 

- - monotonic 282 
MHD calculations 50 

MHD equations 106, 119, 126, 132 

- conservation form of 120 

- one-dimensional 121 

- stiffness of 127 
MHD processes 438 

MHD simulation of jets 445 
Micro-quasar 

- GROJ1655-40 440 

- GRS 1915-hl05 440 

Minimum network 406 
Minkowski diagram 135 
Minkowski metric 136 
Mixed-frame equations 189, 198 
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Mixing 371-373, 376 
~ complete microscopic 427 
Model Lagrangian transfer equation 
250 

Momentum density, monochromatic 
163 

Momentum flux, monochromatic 163 
Moral 

-be careful of frame 231 

- use implicit energy equation 241 
Multi-fluid calculations 428 

Naked beam 449, 456, 457 
Navier-Stokes equations 1, 11 

- incompressible (INS) 129 
Neutrino 

-- emission 360 

- heating 369-371,376,378 

- luminosity 380,381,397 

- mean free path of 362 

- observation 393 

- transport 348,361,369,380,381, 
388 

- trapping 362 

Neu trinosphere 362, 380, 382 
Neutron star see Stars neutron 
Newton-Raphson iteration 279, 307, 
309-310, 431, 454 
Nonphysical see Unphysical 
Normalized form of 

- HD equations 443 

- RHD equations 311 
Nose cone 445, 458 
Nuclear burning 405 

- onset of 328 

Nuclear reaction network 419, 421 

- coupling 425, 426 
Nuclear reactions 50, 90, 337 
Nuclear statistical equilibrium (NSE) 

361,425 

Nucleon number fraction 420 
Numerical dissipation 61 

Oblique shock 446, 449, 451, 456, 457, 
459 

Odd-even decoupling 117 
Oil recovery 127 
Opacity 

- Los Alamos 325 

- OP 269 

- OPAL 269, 272 

- Thomson 407 
Opacity coefficient 169, 171 



Opacity distribution function (ODF) 
229 

Operator splitting 281,350 
Optical depth 173 
Optically thin limit 216 
Oscillation of numerical solution 67, 
68,116, 243,283,301 
Outflow boundaries 83 
Overshooting 427 

p-characteristics 28 
p-family 33 
p-wave 31,122 
Parabolic equation 5, 6, 11, 49 
Parallelization 431 
Parameter study 386, 454 
Pfaffian derivative 207 
Ph.D. thesis 221 
Phase plane 30 

Photo-disintegration 365, 368, 369, 
420, 432 
Photon 

-momentum 178 

- creation 169 

- destruction 169 

- distribution function 162, 180 

- mean free path 166, 169, 170 

- number density 161 

- scattering 170 
Planar limit 204 
Planck 

- function 166 
mean 227, 266 

- spectrum 220 
Planetary nebulae 19, 437 

- MyCn 18 19, 20 

- NGC 6826 20 

Point concentration 292 
Poisson equation 268 
Poisson problem 5, 6, 128 
Poisson solver 129 
Poly tropic gas 10 
Post-processing 406 
Poynting flux 440 
Prandtl number (Pr) 415 
Pressure scale height 265 
Pressure wave 122 
Pressure, negative 112 
Projection method 129 
Prompt explosion 367, 369, 389 
Prompt shock wave 378, 386, 388, 389 
Protostellar collapse 265, 283 
Pseudo-time evolution 283, 303, 305, 
306, 313 
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Pseudoviscosity 238, 239 
Pulsation 

- of a poly trope 475 

- RR Lyrae stars 290 

- stellar 319 

Quad-tree structure 106 
Quadrupole approximation 393, 395 
Quadrupole formula 393-395 
Quasars 

- 3C264 443 

- 3C273 440, 455 

- 3C31 443 

Radiating fluid 194 
Radiating shock 95, 96 
Radiation 
-force 184 

- conductivity 222 

- energy density 162 

- energy equation 268 

-- comoving frame 203, 209, 211, 221 
-- lab freone 175,188 
-- Lagrangian 0(u/c) 204 

- mixed frame 190, 193 

- monochromatic, comoving frame 
202, 245 

- monochromatic, lab frame 175 

- monochromatic, Lagrangian 0{v/c) 
204 

- monochromatic, mixed frame 189 

- tensorial form 210 

- - total, comoving frame 245 

- energy flux, monochromatic 163 

- equilibrium 175 

- flux equation 268 

- force 176 

- isotropic 165 

- momentum equation 

-- comoving frame 203, 210, 211 
--lab frame 176,188 

- Lagrangian 0{v/c) 205 

-- mixed frame 190,193 

- monochromatic, comoving frame 
202, 245 

- monochromatic, lab frame 176 

- monochromatic, Lagrangian 0(v/c) 
204 

- monochromatic, mixed frame 189 

- tensorial form 210 

- total, comoving frame 246 

- pressure 165, 330 

- stress tensor 164 



- stress-energy tensor 182, 183, 194 

- temperature 227 

- thermzd equilibrium 166, 168, 170 

- transfer 12 

-~ isothermal limit 95 
-- stiffness of 13, 50, 95 

- transfer equation 288 

-- comoving frame 201,209 

- lab frame 172, 173 
-- Lagrangian 188 

Lagrangian 0{vjc) 203 
-- Lindquist’s 207 

- mixed frame 189, 192 

- spherical coordinates 173 

- - tetrad frame 206 

- viscosity see Viscosity radiation 
Radiation hydrodynamics, equations of 

194,211 

- adaptive 280 

- continuity 267 

- discrete 287 

- first order diffusion limit 223 

- gas energy equation 197, 212, 217, 
218, 267 

- integral form 276 

- mechaniccil energy equation 196, 
214 

- momentum equation 195, 212, 218, 
267 

- total energy equation 196, 215 
Radiative cooling 316-318 
Radio blob 440 

Radio emission of 

- jets 439 

- lobes 439 

- Pop. I stars 359 

- quasars 455 

- simulated jets 462 

- supernovae 359 
Radio galaxy 440, 442 
Radio knot 462 

Radio map, simulated 463 
Radio variability of jets 440 
Rankine-Hugoniot condition 26, 29, 
33,62 

Rankine-Hugoniot relations 239, 254 
Rarefaction wave 15, 26, 36-40, 59, 
410, 436 

- centered 446, 450, 451 

- plane 446, 450, 451 

- sonic 63 

- transonic 59-61,63 

- with SPH 477 
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Reconstruction 65 

- conservative 66 

- piecewise constant 65 

- piecewise linear 65, 69, 72 
-- piecewise quadratic 68 
Relativistic dust 137 
Relativistic hydrodynamics (RHD) 

133 

Relativistically hot gas 140 
Rest-mass energy 137 
Rest-mass energy vs internal energy 
138 

Reynolds number (Re) 

- SN-turbulence 413 

- white dwarfs 416 
Reynolds number, radiative 225 
Reynolds transport theorem 257, 275 
Ricci rotation coefficients 207 
Riemann invariant 37 

Riemann problem 

- all-shock solution 34 

- Burgers’ equation 25 

- Euler equations 18, 19, 40 

- general 17 

- general solution of 38 

- isothermal equations 35, 40 

- linear system 28, 30, 31 

- MHD equations 125 

- relativistic 139, 140 

- spherical 17, 107 

- with combustion 94 

- with radiation 94 
Riemann solver 263, 282 

- approximate 60 

-- all-shock solver 60,139 

- - for MHD 131 

- - general-relativistic 142 
-- HLLE solver 61,112,118 
-- Osher solver 61,429 

relativistic 139,454 
--- Roe solver 61, 63, 131, 429 
-- Roe solver, failure of 112, 118 

- generalized 97 
Ring-down 398 
Roe matrix 62, 131 
Rosseland mean 220, 222, 266 
Rotation see also Core collapse, 

rotational 

- stabilizing effect 387 
Rotational excitation 273 
Runaway, thermonuclear 409 

Scattering 169 



- Compton 170 
Thomson 170 

Scheme see also Method 

- backward Euler 246 
~ beam 80 

“ capturing- tracking hybrid 428 

- conservative, failure of 435 

- Crank-Nicholson 240 

- essentially non-oscillatory (ENO) 

79 

~ finite volume 347 

- high-resolution 453, 454 

- leapfrog 219, 236 

- MUSCL 68, 78, 444 

- rezoning 256 

“ Runge-Kutta TVD, high-order 454 

- shock-capturing finite volume 444, 
453, 454 

- von Neumann- Richtmyer 238, 239 
Schwarzschild metric 142 

Sedov phase 266 
Sedov-Taylor phase 315 
Self-gravity 265, 268, 285, 463, 464, 

467 

Self-similar solution 316, 366 
Sensitivity study 381 
Sharp- Wheeler model 416 
Shift vector 145 
Shock see also Front, 15 

- absence of, in hot beams 456 

- aspherical 389 

- collision 477, 479, 480 

- conical 461 

- convective flow 379 
“ detection 238, 254 

- energy of 367, 368, 375 

- entropy-violating 26, 63 

- fitting 238 

- formation of 23, 366 

“ formation vs propagation 376 

- general-relativistic 144 

- ideal 435 

- internal 449-451, 461 

- isothermal 95, 253, 453 

- non-steady propagation of 373, 450 
oblique ^ee Oblique shock 

- optically thick 255 

- overcompressive 126 

- prompt see Prompt shock wave 

- radiative 267, 327 
-reactive 409,411,434 
~ reconfinement 462 

- reflected 447, 449 
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- relativistic 135,141,453,455 

- reverse 313, 315, 373 

- slowly moving 113, 116 

- stalled 369,378,381 

- stationary 410 
~ strong 226 

- supercritical 253, 328 

- thickness of 239 

- undercompressive 126 

- weakening 368 

- with SPH 466 
Shock speed 25, 26 

- incorrect 55, 243 
Shock tracking 95, 105 

Shock tube problem 1, 2, 15-17, 63, 
298-302,431,433,476 

- relativistic 141 

- with SPH 478, 479 
Similcirity solution 16, 36 
Simulation 343, 345 

- ID vs 2D, shell burning 427 

- ID vs 3D 348 

- 2D vs 3D 

-- core collapse 404 

- grav. wave signal 405 

- - jets 444 

- - proto-neutron star 378, 396, 398 

- - RT instabilities 376 

- accretion shock 329 

- analysis of 348 

- colliding winds 20, 21, 108, 109 

- convection in hot bubble 380 

- core collapse 369, 371 
-- 3D 404 

--rotational 386-389,403 

- detonations and deflagrations 428 

- dust-driven wind 331 

- grav. wave signal 405 

- jets 443, 444 

- KH instabilities 445 

- mass accretion of red giant 110 

- MHD jets 445 

- N-body 469, 470 

- proto-neutron star 378-380 

- protostellar collapse 328 

- pulsation 475 

- - HdC star 324 
-- LBV star 325 

- - RR Lyrae stars 322 

- relativistic jet 454, 455 

- - cold jet 457 

- - hot jet 457 
--in 3D 455 



- RT instabilities 375-377, 416 

- supernova explosion 378, 383-385 

- supernova remnant (SNR) 313-319 

- supersonic jet 447 
Singularity 

- coordinate system 145-147 

- physical 145, 146 
Slope limiter 68-71, 73 

- for linear system 74 

- for nonlinear system 77 

- MC 73 

- minmod 69, 70, 73 

- monotonized centrgd-difference 
limiter (MC) 70 

- superbee 73 

- van Leer 73 

Smoothed Peurticle Hydrodynamics 
(SPH) 5ee Method, SPH 
Smoothing length 464, 470, 472 

- const, vs variable 475 
SNu 357 

Sod problem 2, 16, 63, 298-302 
Software see Codes 
Solar corona 127 
Solgir wind 106 
Solenoidal 128 
Sonic point 59, 64 
Sound speed 7, 40, 59, 102, 112, 
121-124, 167, 350, 452, 453 
Source function 174, 288 
Source term 12, 347 

- geometric 12, 102, 142 

- gravity 96 

- quasi-stationary 96 

- stiff 90, 94, 95 

- - combustion 90 
-- radiative 13,95 

Sparse Gaussian elimination 431 
Spcu-se linear system 431 
Special relativistic hydrodynamics 
438 

Specific abundance 421 
Spectral profile 227-230, 247 
SPH equation of motion 466 
SPH, accuracy of 465 
Sphericity factor 245 
Splitting 84, 86, 247 

- dimensional 84, 103 

- Godunov 89, 90 

- gravity and 97 

- Strang 89, 90, 100 

Splitting error see Error, splitting 
Stability 45, 52 
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- numerical 422 

- stiffness and 48 

- von Neumann analysis 235 
Stability analysis, von Neumann 240 
Stable 

- conditionally 236 

- unconditionally 240 
Stagnation point 59 
Stars 

- binary 20, 21, 108, 109 

- blue super-giant 360, 376 

- cataclysmic variable 437 

- Cepheids 248 

- compact 437 

- Hydrogen deficient carbon (HdC) 

323 

- Luminous Blue Variables(LBV) 325 

- - eruption of 326 
-neutron 142,251,353,360,393 

- - binding energy 360 
-- collision of 143 

- 0-star 178 

- Population I 359 

- presupernova 251 

- proto- 326-329 

- proto-neutron 369-371, 378-382, 
396-398 

- R CrB 324 

- red giant 110 

- red super-giant 360, 376 
RR Lyrae 248, 319 

- - light curve 323 

- light curve, Fourier decomposition 
of 322 

-- RRab 319 
- RRc 320 

- white dwarf 110 

- Wolf-Rayet 358, 359 
Start-up error 114 
State variables 8 
Stefan’s law 167 
Stefan-Boltzmann constant 167 
Stellar 

- evolution 214 

- interior 212 

- structure 

- boundary conditions 215 

- dynamical calculations 215, 224 

- equations of 214 

Stellar atmosphere dynamics 97 
Stencil 

- 5 point 280, 283, 295, 309 



Stiff source term see Source term, 
stiff 

Stiffness 47 
■ classic example 47 

- equation of state 364, 366, 367, 388, 
400,401 

- heat equation 49 

- hyperbolic equations 49, 50 

- MHD equations 127, 130 

- radiation transfer 50 

- reaction network 421 

stability of numerical methods and 
48 

Stiffness ratio 48 
Strang splitting 89, 90, 100 
Streaming limit 165, 192, 193, 197, 
210,211 

Stress tensor 15 

- Maxwell 120 

- radiation 164 

- stress-energy 134, 137, 138, 142 
-- material 194 

-radiation 182,183,194 

- stress-mass 134 
Sun 178 
Superluminal 

- components 462 
knots 455 

- speed 139, 140, 440-442, 463 
Supernova 

classification scheme 354 

- explosion 17,118,251 

- light curve 251, 253, 353, 355, 372 
progenitor 357, 358, 374, 412 

- - blue super-giant 360 
-- - red super-giant 360 
-- Wolf-Rayet 358,359 

- rate 357, 358 

- remnant (SNR) 312-319 

- - progenitor 313 

- SN 1981B 355 

- SN1984L 355 

- SN1985L 372 

- SN 1985P 372 

- SN1987A 353,354,358,360, 

371-373, 375, 376 

- SN1987F 372 

- SN 1987K 357 

- SN1987M 356 

- SN1988A 372 

- SN1988Z 354 

- SN1993J 357,373 

- SN 19941 356 
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- SNI 354,356,357 

- SN la 356, 358, 359, 408, 412-415, 
417, 418 

- SNIb 354-360,393 

- SNIc 354-360,393 

- SN II 354, 358-360, 376, 393 

- SNII-L 354,357,360 

- SNII-P 354,357,360 

- SNIIb 357 

- SNIIn 354 

- SNIIP 376 

- spectrum 356 

- Type II 251 

- Type I 353 

- Type II 353,372 
Synchrotron radiation transfer 462 

TAMA 392 
Tangent space 185, 188 
Test problems 

- advection 70 

- compositional step-function 351 

- detonation 434, 435, 437 

- grid adaption 297 

- radially-symmetric 103 

- reaction network 432 

- shock tube see Shock tube problem 

- simple combustion 90 

- simple EOS 432 

- Sod’s 2, 16, 63 

- SPH 464,471-473 

- step test 352 
Theorem of Harten 73 
Thermalization length 221 
Thermodynamics, first law of 197, 213 
Thermonuclear reactions 212 

Three body reaction 419, 420 
Three plus one formalism 143-145 
TIGA 393 
Time centering 279 
Time scale 

- acoustic vs dynamic 130 

- burning 407 

- convective 408 

- core bounce 367 

- core collapse 360, 387 

- diffusion of composition 407 

- diffusion of heat 407 

- dyncimical 264 

- dynamical vs collapse 360 

- dynamical, SN 251 

- electromagnetic 119 

- flow 427 



- formation of jets 345 

- free fall 327 

- grid 296 

- grid response vs flow 259 

- hot bubble convection 381 

- human life 345 

- ignition 407 

- neutrino diffusion 363 

- nuclear burning 214, 264, 427 

- nuclear vs convective 408 

- radiative vs dynamic 13, 95 

- reaction 91 

- reaction vs flow 50, 90, 91, 414 

- RT instability 375, 378 

- simulation vs dynamic 348 

- simulation vs time step 47 

- sound travel 408, 450 

- strong and em. reactions 361 

- sweep-up 314 

- thermal 264 

- variety of 47 
Time slicing 145, 146 

- geodesic 146, 147 

Time step restriction 312, 430 
Time step, individual 463 
Total variation 68 
Transverse derivative terms 104 
Triple umbilic point 125, 126 
Triple-a reaction 432 
Turbulent combustion 413-415, 417, 
428 

TVD region 73 

- second-order methods 74 
Two body reaction 420 
Two-temperature description 228 
Two-temperature means 228 

Ultra-relativistic limit 273 
Umbilic point 126 
Unphysical solution 

- artificial stellar pulsation 319 

- combustion 90 

- detonation 92, 93, 410, 434-436 

- from discrepancy EOS vs opacity 
274 

- magnetic monopoles 130 

- negative pressure 112, 432 

- shock speed 55 

- species mass fraction 429 

- with linearized Riemann solver 112 

- with nonrelativistic Riemann solver 
139 

- with SPH 463, 467 
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Unstable, unconditionally 236 
Upstream direction 43 
Upwind direction 43 

Vanishing viscosity approach 23 
Vanishing viscosity solution 24 
Variable smoothing length 463, 470, 
471 

Variables 

- dependent 199 

-- independent 199 

- physical 199 

- primitive, recovery of 139 

- state 8 

Vectorization 431,474 
Vibrational excitation 273 
VIRGO 392 
Viscosity 

- artificial see Artificial viscosity 

- bulk 225 

- numerical see Artificial viscosity 

- pseudo see Pseudo viscosity 

- radiation 220, 223, 224 

- vanishing see Vanishing viscosity 
Viscous 

- energy dissipation 285 

- length scale 286 

~ momentum transfer 285 

- pressure 238 

“ pressure tensor 285 

- profile 112 

- shock layer 1 
Vortex shedding 456 



Wall heating 115 
Wave 

- acoustic 100, 130, 234, 237 
-- spurious 114 

- Alfven see Alfven wave 
compressional 122 

- deflagration 91 

- entropy 

-- spurious 115 

- intermediate 123, 126, 127 

- magnetosonic 122 
--fast 122,123 
--slow 122,123 

- rarefaction see Rarefaction wave 

- shear 122, 141 

- Taylor 410 

- thermal 222 

- transverse 122 
Weak solution 24 
Working surface 446, 447, 450 

X-rays 170,372,405 

Young stellar objects (YSO) 437 

Zeldovich-von Neumann-Doering 
model (ZND) 411 
ZND structure 91-93 




